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Description 

Hepatitis B virus surface antigens and hybrid antigens containing them 

FIELD OF THE INVENTION ^ t , . 

5 The invention relates to hepatitis B virus antigens and hybrid antigens containing them and to the cloning or 
genes which code for said antigens in yeast by use of recombinant DNA techniques. 

BACKGROUND OF THE INVENTION 

10 A. Hepatitis B vaccines 

Infection with hepatitis B virus (HBV) is a serious, widespread health problem. Infection can be manifested in 
acute or chronic phases. In the United States, the number of cases of acute hepatitis is estimated to be at least 
100,000 per year with a fatality rate of 1 to 2 per cent and the prevalence of chronic carriers of HBV among 
healthy adults varies between 0.1 and 1<Vb depending on age and social class. In South America, the 

15 prevalence of chronic carriers is about 1 to 3<Vb; in the U.S.S.R. and southern Europe, about 3 to ©Vb; and In 
Asia and Africa, more than 1(Wb. 

In developed countries, there exists a need for a vaccine for people at high risk of exposure, such as 
patients and personnel in medical units where blood Is handled, military personnel, spouses of chronic 
carriers, travelers to areas of high HBV endemicity, newborns of chronic carriers, homosexuals, prostitutes 

20 and drug abusers. In third world countries, there exists a need for an inexpensive vaccine for mass 
immunization. Mass immunization programs may ultimately affect not only the incidence of acute hepatitis and 
the pool of chronic carriers but may also reduce the morbidity and mortality from chronic active hepatitis and 
hepatocellular carcinoma. 

Dane particles, which are believed to be hepatitis B virions and which are isolatable from infected patients, 
25 have a diameter of about 42nm. Each consists of an envelope comprising the hepatitis B surface antigen 
(HBsAg), a capsid (HBcAg), an endogenous polymerase and a DNA genome. The genome Is circular and 
double-stranded with a single strand region consisting of about 200 bases. The single strand region can be 
filled in, in vitro, by the action of the endogenous polymerase. The longer strand contains approximately 3.200 
bases 

30 For a long time, it has been difficult to prepare HBV vaccines because it has proven difficult to propagate the 

virus in tissue culture and because the only known host is man. However, chimpanzees are also able to be 

Infected In the laboratory with the vims. 
Vaienzuela et ai. (Nature. 298, 347-350, 1982) report synthesis of HBsAg In yeast using an expression vector 

wherein the HBsAg coding sequence is an 835 base pair (bp) Taqi-Hpal fragment and the promoter is the yeast 
35 alcohol dehydrogenase I promoter. Several earlier brief reports noted research preceding this reference. 

These are Vaienzuela et ai. (Arch. Biol. Med. Exp. (Chile). 14(1), 21-22,1981) which reports expression In yeast 

of a DNA fragment containing a sequence which codes for a protein similar to HBsAg flgated to a yeast alcohol 

dehydrogenase promotor region; a report in Scrip No. 616, p. 14 (Aug. 12, 1981), which states that a team ofU. 

S researchers including P. Vaienzuela and W. J. Rutter have announced production m yeast of the 
40 protein-coating surrounding hepatitis B virus"; and Zuckerman (Nature, 2&>, 98-99, 1982) which reports that 

W. J. Rutter has reported expression of glycosylated HBsAg In yeast ceils. 
Antigenic components of HBV. such as HBsAg, are reported to have been prepared in bacteria following 

Insertion of a recombinant DNA molecule containing a gene which codes for the antigen. Burrell et ai. (Nature. 

279 , Number 5708, 43-47, 1979) report expression in E. coli HB1 01 of HBV DNA sequences cloned In plasmid 
45 pBR322. 

Murray et ai., European Patent Application Publication Number 13,828. 1980, disclose preparation of a 
recombinant vector which can code for HBV antigens, including HBsAg. In E. poD strain HB101. The vector is 
prepared from Dane particle DNA and plasmid pBR322. The authors state that useful hosts may include other 
bacterial hosts, yeasts and other fungi, animal or plant cells and other hosts, although the only host 
50 exemplified in the application Is E. coll. 

Chamay, et ai. (Nature, 286, 893-895, 1980) report construction of a bacteriophage carrying a fusion of the 
P-gaiactosldase gene and the HBsAg structural gene. The bacteriophage directs synthesis of a fusion protein 
comprising antigenic determinants of both HBsAg and p-gaiactosidase. 

Tlollais et ai., UK Patent Application Number 2,034,323, disclose preparation of a coliphage containing HBV 
55 DNA. Fused phage-HBV DNA is transformed into E. coll strain C600. 

In UK Patent Application Number 2,070.621, a plasmid which comprises a part of the HBsAg gene and the 
promotor and the Z gene of the lactose operon and which can be cloned in E coli is disclosed. 

Rutter et ai., European Patent Application Publication Number 20,251, 1980. disclose recombinant vectors 
Including a recombinant vector comprising plasmid pBR322 and BamHI fragments of HBV DNA, which can be 
60 used to transform E. coli. Another vector, comprising a BamHI fragment of HBV DNA and a portion of the 
tryptophan operon, was used to obtain expression in E. coll strain HB101. 

Edman et ai. (Nature, 291, Number 5815, 503-506, 1981) describe construction of plasmlds which direct 
synthesis of HBcAg and a pWactamase-HBsAg fusion protein, under the control of the tryptophan operon 
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regulatory region. In E. coll . 
Pumpen et al. (Gene, 30, 201-210, 1984) disclose that small amounts of HBsAg monomer proteins and 

* fusion proteins are synthesized In E. coll using antibodies raised against the denatured HBsAg monomer for 
detection. 

Other references disclosing Insertion of HBV DNA Into bacteria Include Chamay, et al. (Prog. Med. Virol., 27, 5 
88-92. (1981); MacKay et al. (Proc. Natl. Acad. Scl. U.S. 78, Number 7, 46104514, 1981); Fritsch et al. (OR. 
Acad. Scl., 287, Number 16, 1453,(1978) ; U. K. Patent Specification 2,034,323 (Derwent No. 48874C) and Pasek 
et al. (Nature, 282, No. 6. 575, 1979). 

HBV DNA has also been cloned In mammalian cells. These Include, human, mouse, and human hepatoma 
cell lines. For example, Dubois et al. (Proc. Natl. Acad. Scl. U.S.., 77, Number 8, 454^4553, 1980) report 10 
transformation of mouse cells with a plasmld containing the HBV genome and expression of HBsAg; 
Hirschman et al. (Proc. Natl. Acad. Scl. U.S., 77, Number 9, 5507-5511, 1980) report production of HBV-Hke 
particles by HeLa cells transformed with HBV DNA. 

Procedures for preparing HBV vaccine using HBsAg from human blood are reported by Funakoshl et al. 
(Prog. Med. Virol., 27, 163-187, 1981) and Maupas et al. (Prog. Med. Virol., 27, 185-201, 1981). The vaccine 15 
prepared by Funakoshl et al. contains 40 |xg of purified, formalin-treated HBsAg, phosphate, sodium chloride. 
20 mg of mannitol; and 0.1 Qfo of aluminum hydroxide as adjuvant in the latter paper. Maupas et al. report that 
one dose of vaccine was 1 mL of purified, formalin-treated HBsAg containing 2-10 jig/mL of protein (Lown/s 
method) and 0.1% of aluminum hydroxide. The protocol used in the study reported by Maupas et al. called for 
, three Injections at one month Intervals with a booster after one year; the authors propose a protocol so 

consisting of two Injections of concentrated HBsAg et three month Intervals. 

Additional references to preparation of HBV vaccines Include Maupas et al., Adamowicz et al. t and 
Funakoshl et al. at pages 3, 37 and 57, respectively, of Hepatitis B Vaccine INSERM Symposium No. 18, edit, by 

* Maupas and Guesry, 1981, Elsevier/North-Holland Biomedical Press. 

Yeasts have been used as host organisms for expression of certain other non-HBV DNA sequences. For 26 
example, Fraser et al. t U. K. Patent Application 2,068,969, disclose preparation of chicken ovalbumin in yeast; 
Scrip No. 640, p. 11 (Nov. 4. 1981) contains a report that a type of Interferon is being prepared In yeast. In 
European Patent 11,562 (Derwent No. 38762C) are reported hybrid yeast plasmlds containing the ura* yeast 
gene In the 2 p. plasmld. 

The authentic Hepatitis B Virus surface antigen (HBsAg) can be recovered from plasma of Infected 30 
individuals as a particle of about 22 nm comprised of two proteins known as P24, and Its glycosylated 
derivative QP28, both of which are encoded by the 226 amino acid coding sequence tin the HBV genome 
known as the S protein coding sequence: The entire 163 amino acid coding sequence which Immediately 
precedes the S protein coding sequence on the HBV genome Is referred to herein as the Pre S coding 
sequence. The 55 amino acids of the Pre S coding sequence which Immediately precede the S protein coding 55 
sequence are referred to herein as the Pre S2 coding sequence, and the remaining 103 amino acids of the Pre 
S coding sequence are referred to herein as the Pre S1 coding sequence. The Pre S coding sequence, or any 
smaller portion thereof, is also referred to herein as the HBsAg precursor protein. 

For completeness, It should be noted that the entire pre S coding sequence for some HBV subtypes (e.g., 
% ayw) is 163 codons. while the entire Pre S coding sequence for other HBV subtypes (e.g., adw 2 ) Is 174 codons. 40 

In either case, the Pre S2 region Is the 55 codons Immediately preceding the S protein coding sequence. 

The Pre S2 coding sequence codes for the polyalbumln binding or receptor site found on the surface of 
Dane particles and on the surface of some HBsAg particles isolated from sera of HBV-lnfected patients. 

* Although the Pre S2 region Immediately precedes the S protein coding region on the HBV genome, the Pre $2 
region is not Involved in the assembly of HBsAg particles. See, e.g., Perslng et al. (Proc, Natl. Acad. Scl. U.SA, 46 
82. 3440-3444, 1985). 

Valenzuela et al. (Nature. 298, 347-350, 1982) disclose that particles resembling HBsAg were found after 
disruption of yeast cells transformed with a vector containing the S protein coding sequence, and concluded 
that HBsAg particles are synthesized In yeast. 

Harford et al. (Developments in Biological Standardization. 54. 125-139. 1983) disclose that particles so 
resembling HBsAg were found after disruption of yeast cells transformed with a vector containing 18 amirib- 
acids of the coding sequence of ornithine carbamoyl transferase Immediately preceding the N-termirtaJ 42 
amino acids of the Pre S2 coding sequence Immediately preceding the S protein coding sequence. 

Laub et al. (J. Virology, 48(1), 271-280, 1983) disclose construction of a Simian Virus 40 early replacement 
vector that has a large portion of HBV genome DNA Including the Pre S-S protein coding sequence, and 55 
expression of such portion by SV40- transformed CV-1 cells (COS cells) transformed by such vector. 

Stibbe et al. (J. Virology, 46(2), 626-628, 1983) disclose that minor HBsAg glycoproteins, named QP33 and 
QP36, are coded for by the Pre S2-S protein coding sequence. Stibbe et al. (Dev. Btol. Stand., 54, 3343, 1983) 
disclose that HBsAg particles containing large amounts of QP33 and QP36 did not induce higher antIS protein 
antibody titers than those HBsAg particles almost devoid of QP33 and GP36. 60 

Heerman et al. (J. Virol., 52 (2), 396-402, 1984) disclose that the entire 389 amino add coding sequence 
comprising both the S protein coding sequence and the Pre S coding sequence codes for a polypeptide, P39, 
which is found in HBV particles and viral surface antigen filaments along with Its glycosylated form, GP42, and 
% the other HBV surface antigen associated polypeptides P24, GP27, GP33 and QP36. 

Neurath et al. (Science, 224. 392-394, 1984) disclose that polypeptides having the sequence of the 26 65 
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amino-terminal amino acids of the Pre-S2 coding sequence act as an Immunogen, and suggest that antibodies 
elicited by the immunogen can be utilized for diagnostic tests. 

Michel et al. (Proa Natl. Acad. ScL U.S.A., 81. 7708-7712, 1984) disclose synthesis of HBsAg carrying human 1 
serum polyalbumin receptors in Chinese hamster ovary (CHO) cells transfected with a ptesmid carrying the 

5 Pre S2-S protein coding sequence. 

Perslng et aL (Proc. Natl. Acad. Sci. U.S A. f 82, 3440-3444, 1985) disclose that mouse L cells transformed 
with a Pre S2-S protein coding sequence produce three HBsAg related polypeptides (24,000, 27,000 and 
35,000 daltons) all of which may be organized into complex Immunoreactive HBsAg particles of 22 nm 
diameter which bind to polymerized human serum albumin (USA); while mouse L cells transformed with the 

10 Pre S2-S protein coding sequence bearing a frame-shift mutation near the 3* end of the Pre S2 region produce 
only the 24,000 and 27,000 dafton polypeptides organized into 22 nm diameter immunoreactive HBsAg 
particles which are unable to bind HSA. Perslng et al. conclude that the Pre S2-S protein coding sequence 
encodes the 35,000 datton species, that the Pre S2 protein accounts for the HSA binding activity of HBsAg but 
is not required for assembly and secretion of the HBsAg particles; and that the major polypeptide of HBsAg 

15 (i.e., the 24,000 dalton species) Is not derived primarily by cleavage of larger precursors (Le., the 27,000 and 
35,000 dalton species) encoded by the Pre S2-S protein coding sequence. 

Millch et al. (Science, 228, 1195-1199, (1985) disclose that vaccines that contain HBV particles with both Pre 
S2 and HBsAg, prepared from Chinese Hamster Ovary (CHO) cells transfected with a plasmid containing the 
Pre S2-S protein coding sequence, can circumvent nonresponsiveness In certain mice to vaccines which just 

20 contain only HBsAg; and that the 26 amino acid residues at the NHsrtermlnus of the 33,000 dalton polypeptide 
coded for by the Pre S2-S protein coding sequence represent a dominant antibody binding site on the Pre S2 
region. 

Michel et al.. "Vaccines 86", Brown et al.. Ed„ Cold Spring Harbor Laboratory (1986), 356-363. discuss the 
synthesis in CHO cells of Hepatitis B surface antigen particles containing the Pre S2 region expression 
25 product 

Neurath et al. (Nature. 315, 154-156, 1985) disclose that the Pre S region codes for proteins on the HBV 
envelope with domains specifically recognized by Over cells; the Pre S2-S protein coding sequence codes for a 
protein present in HBV particles; synthetic peptides corresponding to the gene encoding Pre-S proteins are 
immunogenic; and conclude that HBV vaccines should contain Pre S determinants. 

30 Valenzuela et al. (Biotechnology, 3, 317-320, 1985) disclose the expression of the entire pre S2-S protein 
coding sequence In yeast transformed with such coding sequence; and also disclose that such yeast cells do 
synthesize a particle containing both HBsAg and Pre S2 which is very similar In electronmicroscopy and 
sedimentation properties to particles containing only HBsAg, and that the Pre-S2 region does not infer the 
ability to form particles resembling the 22 nm HBsAg particles. Valenzuela et al. also disclose that it has been 

35 hypothesized that HBV gets Into the liver by binding polyalbumin to its polyalbumin receptor which In turn 
binds the polyalbumin receptor to the liver cell and thus, the virus gets internalized, and that the polyalbumin 
receptor of the HBV is encoded by the Pre S2 region. Valenzuela concludes that a vaccine containing Pre S2 
will elicit antibodies which, in addition to Inactivating HBV through the normal mechanism, might interfere with 
the way the virus enters liver cells. 

40 Takeda Chemical Ind. KK, European Patent Application Publication No. 171,908-A, claims the production of r 
non-glycosytated hepatitis-B virus surface antigen P31 protein in yeast 

Chiron Corporation, European Patent Application Publication No. 0,174,444 A2, claims a method for 
preparing HBsAg particles containing P31 protein In yeast I 
The following Table I compares known Pre S2 region amino acid sequences with the HBV pre S2 amino acid 

45 sequence of the subject invention : 
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Table I Comparison of amino add sequence of pre S2 
* region from different serotypes of HBV 

Pre S2 coding region 

-30 -20 -10 0 Reference 

adw HQWNSTAFHQALQ0PRVRGLYFPAGBSSS6TVNPAPNIASHISSSSARTGDPVTN The subject 

Pre S2 cod- 
ing region 



adw T L I 1 

adw L L I I 2 

adw I 3 

* adw K I 4 

adr T L V TT P IFS AP 5 

adr T L V TT P I S AP . 6 

ayw T T VLTT PL IFS I AL 7 

adyw H T T V TH P IFS I AL 8 



Amino Add Abbreviation Key 



HBV 

sub- 

tyjje 



-55 -50 



Asp 


0 


Aspartic add 


lie 


I 


, Isoleucine 


Thr 


T 


Threonine 


Leu 


L 


Leucine 


Ser 


S 


Serine 


Tyr 


Y 


Tyrosine ■ 


Glu 


E 


Glutamic acid 


Phe 


F 


Phenylalan* 


Pro 


P 


Proline 


His 


H 


Histidine 


61y 


G 


Glycine 


Lys 


K 


Lysine 


Ala 


A 


Alanine 


Arg 


R 


Arglnine 


Cys 


C 


Cysteine 


Trp 


W 


Tryptophan 


Val . 


V 


Valine 


Gin 


0 


Glutamine 


Met 


H 


Methionine 


Asn 


N 


Asparagine 



ft 



5 



0278 940 



Reference Key 

1. Valenzuela et ai„ In Animal Virus Genetics (Field, Jaenisch and Fox eds) p. 57-70, Academic Press, 
New York (1980). 

2. Chow et aL, Fourth Annual Congress for Recombinant DMA Research, San Diego, USA, 1984. 
5 3. Fujisawa et al., Nucl. Adds Res., 11, 4601-4610 (1983). 

4. Lo et aL, Biochem Biophys. Res. Com. , 2, 382-388 (1 986). 

5. Fujisawa et al. f cited above. 

6. Ono et al., Nucl. Acid Res., 11, 1747-1757 (1983). 

7. GaJibert et al., Nature , 281 , 646-650 (1979). 
10 8. Pasek et al., Nature , 282, 575-579 (1979). 

B. Malaria Vaccines 

Numerous recent studies suggest that protective immunity in a sensitive host to infection by the sporozofte 
stage of a particular species of Plasmodium can be mediated via antibodies produced by such host to the 
15 clrcumsporozoite protein (CS protein) of the particular sporozofte. 

The sporozofte proteins of some types of Plasmodium have been cloned and some of those have been 
expressed by recombinant DNA techniques. 

Nussenzweig et al. (U.S. Patent 4,466,917) disclose a sporozofte polypeptide, identified as the P-44 protein, 
and its cloning and expression in EL cojL 
20 Sharma et al. (Science, 228, 879-882, 1985) disclose the expression of a portion of the circumsporozofte « 
(CS) protein of Plasmodium knowlesl in yeast by employing an expression vector containing the 5* regulatory 
region of the yeast alcohol dehydrogenase I gene in place of the 5' upstream region of the P. knowlesl CS gene 
sequence. s 

New York University, PCT application Publication Number WO84-02922-A, disclose cloning of a portion of f 
25 the coding region for the P. knowlesl clrcumsporozoite (CS) protein repeat unit and expression of 
beta-tactamase and beta-gaiactosidase fusions thereof In EL cofi. 

Kemp et al., PCT Patent Application Publication Number WO84-02917-A, disclose cloning and expression In 
E. coll of a coding sequence for the CS protein derived from the blood stage of P. falciparum. 

Dame et aL (Science, 225, 593, 1984) report cloning and expression of the CS protein of P. falciparum in E. 
30 coll. The protein Is described as comprising about 412 amino acids with an approximate molecular weight of 
44,000. it comprises 41 tandem repeats of a tetrapeptide. Synthetic 7-, 11- and 15- residue peptides derived 
from the repeat region bound to monoclonal antibodies raised against the CS protein. 

□lis et al. (Nature, 302, 536-538, 1983) report expression of a beta-lactamase - P. knowlesl CS protein fusion 
in E. con. 

35 Enea et al. (Proc. Natl. Acad. Scl., USA, 81, 7520-7524, 1984) report an analogous repeat unit structure 
within the CS protein of P. cynomolgl, and the cloning and expression of the CS protein in E. con as weQ as a 
disclosure of its coding sequence. 

Ballou et al. (Science, 228, 996-999, 1985) disclose that synthetic peptides of the repeating region of the CS 
protein of Plasmodium falciparum conjugated to bovine serum albumin (BSA) or to thyroglobufin produced 
40 antibody responses in mice and rabbits, and conclude that a vaccine against the sporozofte stage of the • 
malaria parasite can be developed by using syrrthe tic peptides of the repeating region of the CS protein 
conjugated to a carrier protein. 

Weber et al. (Molecular and Bacterial Parasitology, 15, 305-316, 1985) disclose the use of a cloned CS 
protein gene of a Brazilian strain of P. falciparum as a probe to analyze the structure of 17 other P. falciparum 
45 strains by nucleic acid hybridization, and conclude that the CS protein gene is highly conserved and that 
malaria vaccine development with the CS protein is unlikely to be complicated by strain variation. 

Amot et al. (Science, 230, 815-818, 1985) disclose the cloning of the CS protein of P. yjvax and disclose its 
entire coding sequence and conclude that the coding sequence of the CS protein of P. yjvax is homologous to 
that of the CS gene of P. knowlesi but not P. falciparum. 
50 Sharma et al. (Science, 229, 779-782, 1985) disclose the complete nucleotide sequence of the coding region 
of the CS gene of the Nuri strain of P. knowlesl. 

Young et ai. (Science, 228, 958-962, 1985) disclose the expression of P. falciparum CS proteins In E. coil for 
potential use in a human malaria vaccine. 
The Walter and Eliza Hail Institute of Medical Research, PCT Patent Application Publication No. 
55 WO84/02917, 1984. disclose artificially constructed polynucleotide sequences substantially corresponding to 
all or a portion of P. falciparum mRNA or genomic DNA; the peptide corresponding to such sequence and a 
method for production thereof; a composition for stimulating immune responses against P. falciparum 
antigens in a mammal comprising such peptide. 
Mazier et al. (Science, 231, 156-159, 1986) disclose that antibodies raised In mice Immunized with several 
60 recombinant and synthetic peptides of the clrcumsporozoite protein of P. falciparum were evaluated for 
protective activity In a human hepatocyte culture system and were found to exhibit a protective effect against 
the parasite at three points: sporozofte attachment to the hepatocyte surface, entry and subsequent 
intracellular development. 

5 
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C. Polyvalent vaccines 

Valenzuela et al. (Biotechnology, 3, 323-327, 1985) disclose the use of the HBsAg polyalbumln receptor 
3 coded for by the Pre S2 coding sequence as a means to prepare polyvalent vaccines. Valenzuela et al. also 

disclose preparation of a hybrid HBsAg-Herpes simplex 1 virus glycoprotein D (HSV-JgD) particle by 
expressing a hybrid HSV-lgD-Pre S2-S protein coding sequence In yeast 5 

Valenzuela, "Hepatitis B subunit vaccines using recombinant DNA Techniques," May 15, 1985 
Blo-Expo-5~Meoting, Boston Massachuseetts, discloses that with a hybrid such as the HBsAg-HSV-lgD 
particle described above, there Is a problem of immunodominant of the HBsAg over the foreign Immunogen 
presented on its surface. 

Chiron Corporation, European Patent Application Publication No. 0,175,261 claims a novel hybrid particle 10 
containing at least a major portion of the Hepatitis B surface antigen fused to one or more oligopeptides, 
where the hybrid polypeptide is capable of forming a particle In a cellular host with presentation of at least one 
epitope of at least one oligopeptide. 

Proteins can undergo post-transIatk>nal modifications and some of these can profoundly affect the protein's 
conformation and function. The presence of oligosaccharide chains on human derived preS1-preS2-S and 15 
preS2-S protein (Heerman et al. in J. Virol., 52, 39&402, 1984; Stibbe and Gerilch In Virology, 123. 436- 442, 
1982; Stibbe and Gerilch In J. Virol., 46, 626-628, 1983) and presumably of myristic add oh the preS1- preS2-S 
protein (Perslng et al. Abstracts of papers presented at the 1986 meeting on Molecular Biology of Hepatitis B 
viruses, August 28-31, 1986, Cold Spring Harbor Laboratory) might Influence particle formation, conformation 
, of the proteins in the particle and the antigenicity and immunogenlclty of these particles. 20 

Yeast ( Saccharomyces cerevislae ) has the enzymaticcapabllity for protein glycosytation (Ballou, The 
Molecular Biology of the Yeast Saccharomyces, Metabolism and Gene Expression"; Strathem, Jones and 
Broach Ed., Cold Spring Harbor Laboratory (1982), p. 335-380) and fatty acid acytation (Towler and Glasier In 
« Proc. Natl. Acad. Sci., 83, 2812-2816, 1986) similar to those of higher eukaryotic cells. 

Viral surface glycoproteins expressed in yeast have been found to be glycosylated. Jabbar et al. (Proc Natl. 25 
Acad. Sci., 82, 2019-2023, 1985) disclose that expression of the Influenza viral hemagglutinin gene In yeast 
resulted in a glycosylated hemagglutinin protein. Wen and Schlesinger (Proc. Natl. Acad. Sci., 83, 3639-3643, 
1986) disclose that expression of Slndbis and Vesicular stomatitis virus glycoproteins In yeast gave 
glycosylated forms of these viral proteins. Fujisawa et al. (Abstracts of papers presented at the 1986 meeting 
on Molecular Biology of Hepatitis B viruses, August 28-31, 1986, Cold Spring Harbor Laboratory, page 62) 90 
disclose that expression of the preS2-S gene In yeast results in the synthesis of two glycosylated forms of the 
preS2-S protein. Knlskem et al. (Abstracts of papers presented at the 1986 meeting on Modem Approaches to 
New Vaccines, Including Prevention of AIDS, September 9-14, 1986, Cold Spring Harbor Labo ratory, page 89) 
disclose that expression of the preS1-preS2-S gene in yeast results In the synthesis of two preS1-preS2-S 
proteins of 45 kD and 39 kD. Perslng et al. (Abstracts of papers presented at the 1986 meeting on Molecular 35 
Biology of Hepatitis B viruses, August 28-31, 1986, Cold Spring Harbor Laboratory, page 19} disclose that 
labeling of COS7 cells expressing preS1-preS2-S, preS2-S and S proteins with 3 Hrnyristic acid results In the 
presence of label In the preS1-preS2-S protein. 

Fusion proteins expressed in yeast and reassembled In a particle are purified by the general scheme: 
t AerosII adsorption and desorption (1) — ► CaCk precipitation (2) phenytagarose or phenylboronateaga- 40 

rose adsorption and desorption (3) CsCI gradient centrifugation or DEAE-lon exchange chromatography. 

Step 1 is described in EP application publication 0 204 680 and step 3 is described In EP application 
publication 0 1 99 698 as far as phenylag arose is concerned. The specific combination of step (1 ) and (3) Is able 
* to perform a drastic clean-up of contaminating material (elimination of 90Qto protein, carbohydrates and 50% 

lipids). This novel combination allows the good functioning of the next chromatographic steps. 45 

In case that fusion proteins are glycosylated, phenylboronate (PBA) Is used as an affinity adsorbent. At pH 
9.0 the boronate group makes covaient complexes with potential cls-diol groups on the glycosylated 
sldechains. PBA gels have been used occasionally for the purification of soluble glycoproteins but not for the 
purification of particles containing glycoproteins embedded In the lipid bilayer (See, Cook, et al., Analytical 
Biochemistry, 149, 34^353, 1985; Middle, et al.. Biochemical Journal, 209. 771-779, 1983; Cook, et el, BO 
Biochimlca et Biophysics Acta, 828, 205-212, 1985; Maestasane, et al., Journal of Chromatography, 189 , 
225-231 , 1980). Because of the multiple ligand binding sites on the particle, PBA gels with very low PB-Bgand 
density have to be used, e.g., PBA gels with boronate content < 10jig/ml gel, preferably Spg boronate/ml gel. 
The use of phenylboronate gels with very low boronate content allow affinity chromatography of particles 
containing embedded glycoproteins. 55 

SUMMARY OF THE INVENTION 

This invention relates to a recombinant DNA molecule comprising a functional DNA coding sequence fused, 
in phase, to a portion of the Pre S2 region of a Hepatitis B virus (HBV) Pre S2-S protein coding sequence. The 
term "coding sequence" or "coding region" as used herein also encompasses any functional derivative 60 
thereof. By the term "functional derivative" Is meant a coding sequence with amino acid alterations which, 
where appropriate, dont Interfere with particle formation and/or which retain immunogenlclty where such 
retention is desirable. Such functional derivative can be prepared by conventional site specific mutagenesis 
such as by the methods disclosed by Botstein et al. (Science , 229 , 1193-1201, 1985. Preferably such DNA 
molecule also comprises a regulatory region which Is preferably derived from the yeast erg 3 gene. 65 
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This Invention also relates to a recombinant vector which comprises a recombinant DNA molecule 
operatively linked to a regulatory region, wherein the DNA molecule contains a functional DNA coding 
sequence fused, In phase, to a portion of the Pre S2 region of a HBV pre S2-S protein coding sequence. By the 
term 'regulatory region* as used herein is meant a DNA sequence which contains a promoter region and other 
5 sequences necessary for transcription and translation of a coding sequence. 

This invention also relates to a eukaryotic host ceil transformed with a recombinant vector, wherein said 
vector comprises a DNA molecule operatively linked to a regulatory region, and wherein said DNA molecule 
contains a functional DNA coding sequence fused, In phase, to a portion of the pre S2 region of a HBV Pre 
S2-S protein coding sequence. This Invention also relates to a method of preparing such transformed host cell 
10 which comprises transforming a eukaryotic host cell with such recombinant vector. 

This invention also relates to a method for preparing a hybrid particle containing HBsAg protein which 
contains and/or presents a peptide encoded by afunctional DNA coding sequence, which comprises culturing 
a eukaryotic host cell transformed with a recombinant vector in appropriate culture, media and isolating the 
particle form a cell lysate or extract of such host's culture, wherein said vector comprises a DNA molecule 
15 operatively linked to a regulatory region, wherein the DNA molecule contains the functional DNA coding 
sequence fused; in phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence. 

This Invention also relatives to a hybrid immunogenic particle comprising a Hepatitis B virus surface antigen 
(HBsAg) particle which contains and/or presents a peptide encoded by a functional DNA coding sequence. 
This Invention also relates to a vaccine comprising an ImmunoprotectJve amount of such hybrid particles. 
20 This Invention also relates to micelle comprising such hybrid particle and polysorbate and to a vaccine 
comprising an immunoprotective amount of such micelles. 

By the term functional DNA coding sequence' is meant a DNA coding sequence which, when fused, in 
phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence, Is not Involved with, and 
does not interfere with, the assembly of the HBsAg particle. 
25 Preferred functional DNA coding sequences Include, but are not limited to, (a) the entire HBV Pre S protein 
coding sequence, or an Immunogenic derivative thereof, such as, but not limited to, the Pre S2, Pre S1 or Pre 
S1-Pre S2 protein coding sequence; and (b) other immunogenic coding sequences such as the coding 
sequence of the circumsporozoite protein of Plasmodium , or any Immunogenic derivative thereof, the 
envelope gene coding sequence of HIV or any immunogenic derivative thereof, especially the Ci peptide, 
30 peptide 121 or Dreesman peptide coding sequence or an immunogenic derivative thereof. 

This invention also relates to a HBV Pre S1 protein coding region which codes for the following amino acid 
sequence: 



35 
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-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 
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CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asa Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



or to a HBV Pre S2 protein coding region or to a HBV Pre S2-S protein coding region wherein the Pre S2 
portion thereof codes for the following amino acid sequence: 

-55 -50 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-Leu-Gln-Asp- 
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30 



35 



40 



-40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 45 

Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- so 
-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn; 

66 

a recombinant DNA vector comprising such Pre S1, Pre S2 or Pre S2-S protein coding region operatlvefy 
linked to a regulatory region; a transformed host cell containing such vector; a process for preparing such 
transformed cell which comprises transforming a host ceil with such vector; the protein coded for by such 
coding region; a process for preparing such protein; a vaccine comprising an fmmunoprotectfve amount of 
such protein, the particle code for by such Pre S2-S protein coding region; a process for preparing such qo 
particle, a vaccine comprising an fmmunoprotectfve amount of such particle, and a micelle comprising such 
particle and polysorbate, and a vaccine comprising an Immunoprotecttve amount of such micelles. 

This Invention also relates to a recombinant DNA vector comprising an entire Pre S1-Pre S2~S protein 
coding sequence or PreS1 -functional DNA coding sequence-PreS2-S protein coding sequence, or 
Immunogenic derivative thereof, operatively linked to a regulatory region; to a yeast host cell transformed with es 
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such vector; to a method of preparing such transformed host which comprises transforming a yeast host cell 
with such vector; to a method of preparing protein encoded by the Pre S1-Pre S2-S protein coding sequence, 
or Pre-S1 -functional DNA coding sequence - Pre S2-S protein coding sequence, or an immunogenic derivative 
thereof; to the protein prepared by such method; to a vaccine comprising an Immunoprotective amount of 
such protein; to a particle containing polypeptide encoded by the Pre S1-Pre S2-S protein coding sequence or 
Pre S1 -functional DNA coding sequence - Pre S2-S protein coding sequence or an immunogenic derivative 
thereof; to a method of preparation thereof, a vaccine comprising an Immunoprotective amount of such 
particles, a micelle comprising such particle and poiysorbate, and a vaccine comprising an immunoprotective 
amount of such Micelles. 

This invention also relates to a Pre S1-Pre S2 protein coding sequence or Pre S1-Pre S2-S protein coding 
sequence of which the Pre S1-Pre S2 portion comprises the following amino acid sequence: 

rRE-Sl REGION 

-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro He Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 



TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

45 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly He Leu Gly Trp ser Pro Gin Ala Gin 

50 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr val Ser Thr He Pro 

55 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

60 



65 
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AGG GAG CCT ACT CCC 
Arg Gin Pro Thr Pro 

AGA GAC AGT CAT CCT 
Arg Asp Ser His Pro 

PRE-S2 REGION 

ATG CAG TGG AAT TCC 
Met Gin Trp Asn Ser 

GCT CTG CAG GAT CCC 
Ala Leu Gin Asp Pro 

TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 

ACA GTA AAC CCT GCT 
Thr Val Asn Pro Ala 

CAC ATA TCG'TCA AGC 
His lie Ser Ser Ser 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 
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ATC TCT CCA CCT CTA 
lie Ser Pro Pro Leu 

CAG GCC 
Gin Ala 



ACT GCC TTC CAC CAA 

Thr Ala Phe His Gin 

AGA GTC AGG GGT CTG 
Arg Val Arg Gly Leu 

GGC TCC AGT TCA GGA 
Gly Ser Ser Ser Gly 

CCG AAT ATT GCC TCT 
Pro Asn lie Ala Ser 

TCC GCG AGG ACT GGG 
Ser Ala Arg Thr Gly 



This invention also relates to a recombinant vector comprising such novel Pre S1-Pre S2 or Pre S1-Pro S2-S 
protein coding sequence operative^ linked to an expression control sequence; a host ceil transformed with 
such vector; a method of preparing such transformed host which comprises transforming a host cell with such 
vector; to a method of preparing protein encoded by such Pre S1-Pre 82 or Pre 81-Pre S2-S protein coding 
sequence, or an immunogenic derivative thereof; to the protein prepared by duch method; to a vaccine 
comprising an Immunoprotective amount of such protein; to a particle containing polypeptide encoded by 
such Pre S1-Pre S2-S protein coding sequence or an Immunogenic derivative thereof; to a method of 
preparation thereof, a vaccine comprising an Immunoprotective amount of such particles, and a micelle 
comprising such particle and potysorbate. 

The following represents the key to the amino acid abbreviations used In this application: 
Asp Aspartic add 
Thr Threonine 
Ser Serine 
Glu Glutamic add 
Pro Proline 
Gly Glydne 
Ala Alanine 
Cys Cysteine 
Val Valine 
Met Methionine 
lie Isoieucine 
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Leu Leucine 
Tyr Tyrosine 
Phe Phenylalanine 
His Histidine 
Lys Lysine 
Arg Arglnine 
Trp Tryptophan 
Gin Glutamine 
Asn Asparagine 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 1s a restriction endonuclease cleavage map of pRIT10601. 

Figure 2 Is a restriction endonuclease cleavage map of pRITI0616. 

Floure 3 Is a restriction endonuclease cleavage mapof pMC200. IJPonrt 
4 \s ^nucleotide sequence of a portion of the 3300 bp] Hlndlll yeast DMA insert in pMC200 
which portion contains Hlncll, Bglll and EcoRI sites. 
Figure 5 is a flow sheet illustrating preparation of pRm 0671 and pRITI 0673. 
Figure 6 Is a flow sheet illustrating preparation 35 of pRIT10749. 

Figure 7 is a flow sheet illustrating preparation of pRIT10761 andpRIT10764. . _q . „ nrTin1 ~> 

Fiaure 8 is a flow sheet illustrating preparation of P RrT10167 (Example 1), pRIT10158 and PR™H62 
(E Se 3); SrSoT^ pRrT10677° ^iRITIOOOO (Example 4); pRIT1Q911 (Example 5); pRfnO^ 
^<W.pmM4 (ibSmple 6); P RrT12211. pRIT12209 and pRIT12230 (Example 7); and pRIT12288 

and pRIT12322 (Example 8). npM#M ^ 
Figure 9 Is a restriction endonuclease cleavage map of pRIT10172. 

Rgure 13 Is a restriction endonuclease map of pRFT12309. 

Figure 141s afkwsheet Illustrating preparation of pRrT12314 and pRrT12544. 

Rgure 15 Is a flow sheet illustrating preparation of pRIT12658. 

Figure 16 Is a restriction endonuclease cleavage map of pRIT12662. 

Figure 17 is a flow sheet illustrating preparation of pRIT12660. 

Rgure 18lsaftowsheetinustraflngpreparationof pRIT12B45 (a,b.c,d,e,f). 

Figure 1 A is a flow sheet Illustrating the preparation of pRIT12662. 

Figure 2A shows a schematic restriction map and sequencing strategy of clone AMP"- 

Rgure 3A shows the nucleotide sequence of the clrcumsporozoite protein genefrom P. falciparum. 

Rgure 4A shows the PreS1-PreS2 DNA coding sequence comprised by pRIT12792. 

Rgure 1B is a schematic restriction map of the HIV B virus envelope region. 

Figure 2B Illustrates the construction of ptasmid pRITI 2893. 

Rgure 3B Illustrates the construction of ptasmid pRlT12897 . 

Rgure 4B illustrates the construction of ptasmid pRIT12899. 

Rgure 5B Illustrates the construction of ptasmid pRITX. 

DETAILED DESCRIPTION OF THE INVENTION 

B ^^^tS^»^SSi ^ te meant a DNA sequence which contains sequences necessary or 
desirable for transection and translation of a coding sequence. Such regulatory regjons are genera^placed 
T^rifrxM* coding sequence which is to be expressed. Preferably afurther ^ 
corrtalningTslgnal for the termination of transcription is placed Immediately 3* to the coding region to be 
expressed so as to effect termination of the transcript 

By "HBsAg" as used herein is meant an antigen containing proteins encoded by the S protein coding 
sequence of the HBV genome, wherein such HBsAg is structural* similar ^ authentic HBsAgor has 
substantially the same S protein coding sequence encoded antogenic determlr^ as^a^er^ H^g^at 
is. is capable of stimulating an immune response which sped flcally recognize and react with authentic HBsAg 
or Is specifically recognized by antJ-HBsAg antibodies. . . . . . 

The HBsAg codingsequence can be Isolated from DNA extracted from Dane particles In Infectedhunwi 
serum by fining In the single strand region with a DNA polymerase, preferably ^^^^^To^ie 
followed by digestion with a restriction endonuclease. The choice of e " d ^" c '^^» d ^ n n d * 
particular Dane particles. For example, the HBsAg coding sequence of HBV DNA of certain Dane p^rtlctes of 
tteadw serotypecan be Isolated on a single BamHI fragment; the HBsAg wxflng sequence of HBV DNA of 
^rtaJfTDane Policies of the ayw serotype canbelioJated on a Hhal fragment. HBV DNA of Dane particles of 
the same serotype may also exhibit different patterns of restriction sites . m . e . 

Restriction of DNA to prepare DNA fragments used In the Invention, ligation of such fragments to prepare 
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recombinant DNA molecules used in the Invention and insertion into microorganisms are carried out by known 
techniques such as techniques disclosed In the previously and subsequently cited references. Conditions are 
selected to avoid denaturation of the DNA and enzymes. For example, the pH is buffered to remain at about 7.0 
to 1 1 .0 and the temperature is kept below about 60° C. Preferably restriction Is carried out at a temperature of 
about 30 to 40° C and ligation Is carried out at about 0 to 10°C. Restriction enzymes and ligases used In 
carrying out this invention are commercially available and should be used in accordance with Instructions 
Included therewith. T4 DNA ligase is the preferred llgase. 

The various fragments and final constructions may be Joined together In accordance with conventional 
ways. In many cases, genes have been Isolated and res trfctlpn mapped, as well as sequenced. To that extent, 
one can select the sequence of interest, such as the HBsAg sequence, by restriction of the gene, employing 
further manipulation as necessary such as resection with Baffl , in vitro mutagenesis, primer repair, or the like, 
to provide a fragment of a desired size, Including the desired sequence, and having the appropriate termini. 
Linkers and adapters can be used for Joining sequences, as well as replacing lost sequences, where a 
restriction site employed was Internal to the region of interest. The various fragments which are isolated, may 
be purified by electrophoresis, electrocuted, llgated to other sequences, cloned, relsolated and further 
manipulated. 

The use of regulatory regions of controlling transcription of the structural gene of interest, such as the 
HBsAg sequence, may allow for growing the host ceils to high density with no or low levels of expression of the 
structural gene, such as the HBsAg sequence, and then Inducing expression by changing the environmental 
conditions, e.g., nutrient, temperature, etc. 

For example, with the GAL4 regulatory region, the yeast cells can be grown in rich media with a 
glyceroMactic acid combination to high density, e.g., mid or late log phase, followed by switching the carbon 
source to galactose. As another example, for PH05 regulation, one could grow the cells at a high phosphate 
concentration of about 7 mM KH2PO4, and then recover and resuspend them in medium lacking phosphate to 
effect derepression and expression. For temperature sensitivity, one could grow the cells at 25° to 37° C and 
then change the temperature as appropriate by about 5° to 20° C. The host cells would have the regulatory 
system associated with the regulatory region employed. 

Fusing of the HBsAg sequence to the regulatory region can be accomplished by use of Intermediate 
vectors. Alternatively, the HBsAg sequence can be Inserted directly Into a vector which contains the regulatory 
region. A vector is DNA which can carry and maintain the DNA fragment the invention. Including, for example, 
phages and plasmlds. Techniques for cloning DNA fragments In phages are disclosed, for example, by 
Charnay et al. Nature , Volume 286, 893-895 (1980) and Ttollals, United Kingdom Patent Application 2,034,323. 
Preferably, the HBsAg sequence Is positioned relative to the regulatory region such that the HBsAg 
synthesized by expression of the HBsAg sequence is devoid of extraneous amino acid residues. 

In one embodiment of this Invention the HBsAg coding sequence together with 42 codons of the HBsAg 
precursor (preS-S) protein were fused in phase after the 18th amino acid of the coding sequence of the yeast 
OCT protein which is transcribed from the erg 3 promoter. This construct gives a hybrid particle containing 
HBsAg protein containing an additional 60 N-termtnal amino adds to the 228 amino acids of the S-proteln 
proper. 

An exemplary regulatory region Is derived from the yeast arg 3 gene which codes for ornithine 
carbamoyl-transferase (OCT). The arg 3 regulatory region Is subject to both specific and general control 
mechanisms. It has been cloned In E. coll on plasmld pY eura 3 arg 3 as reported by Crabeel et al., Proc. Nati. 
Acad. Scj. U.SA , Volume 78, 6026 (1981). Preferred hosts are S. cerevtslaa strains In which the arglnine 
blosynthetic pathway Is derepressed, such as strain Icl697d. Use of such strains results In Increased 
expression from the arg 3 promoter as compared to strain DC5. Examples of other useful regulatory regions 
include those derived from yeast genes of the glycolytic pathway such as those coding for enolase, aldolase, 
phosphoglycerate kinase and gJyceraldehyde-3-phosphate dehydrogenase; the yeast alcohol dehydrogenase 
gene; and, In general, genes involved in catabollsm such as the arglnase gene. 

A preferred vector tor cloning the fused DNA fragment Into yeast Is the plasmld YEp13, which Is capable of 
replication and maintenance in both E. coll and S. cerevislae and Is, therefore, known as a shuttle vector. 
Several other yeast vectors are known and available. The HBsAg and regulatory regions can be inserted Into a 
yeast vector sequentially or simultaneously. Transformation with plasmld vectors containing autonomous 
replication sequences (replicons) functional In yeast win normally result in extra-chromosomal maintenance of 
the DNA molecule of the Invention as a plasmld. Such replicons are well known in the art. Transformation with 
plasmld vectors not containing replicons functional In yeast can result In Incorporation of the DNA molecule 
into chromosomal DNA. 

Vaccines for stimulating protection against HBV Infection In humans comprising HBsAg produced by yeast 
in accordance with the invention and a suitable carrier can be prepared by known techniques. Use of an 
adjuvant, such as aluminum hydroxide or aluminum phosphate, is desirable. The HBsAg so produced can also 
be combined with other immunogens to prepare combination vaccines. The HBV or combination vaccines can 
be administered, for example, by the subcutaneous, intravenous or Intramuscular routs. The DNA fragment of 
the Invention and the HBsAg produced thereby, can also be used as a probe for detection of HBV In biological 
samples by DNA hydridization techniques and various immunoassays. 
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PYP HFRRIOM OF A DMA MOLECULE CONTAINING A FUNCTIONAL DMA CODING SEQUENC E F USED , IN 
PHASE TO A PORTION OF THE PRE S2 REGION O F A HBV PRE S2-S PROTEIN CO D ING SEQUENCE 
This i'nvention also relates to a recombinant DNA molecule comprising a functional DNA coding sequence 
fused, in phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding seo^eiK^Theterm i coding 
sequence" or 'coding region* as used herein also encompasses any functional derivative thereof. By the term 
•functional derivative' is meant a coding sequence Involving amino acid alterations which, where desired, 
dont Intef ere with particle formation and/or which retain Immunogentcity. Such functional derivatives can be 
prepared by conventional site specific mutagenesis. Site specific mutagenesis techniques are taught by 
BotsTeln et al., Science , 229. 1193-1201 (1985). Such fusion may occur at the N -^nTilnuS of the Pre ^ reg^ 
or at some pointer*, the" Pre S2 region. There can be four types of fusions i.e. at the 5 -tern in us of the 
entire Pre S2 region, at some point within the Pre S2 region so that Pre S2 DNA is ftanWng both sides of fee 
functional DNA coding sequence, at the ^-terminal of a truncated portion of the Pre .S2 region, or at fte 
SMermlnuB of the S-protein coding region so that the fusion contains no pre S2 DNA. Various Pre S2-S protein 
coding sequences are known and can be prepared or Isolated by known techniques. [See references cited 
herein, supra]. In one embodiment of this Invention, the ARG3 regulatory region together with 18 anMnoacWs 
of the yiiitOCT protein were fused, in phase, to a truncated Pre S2-S protein ceding seouence such that the 
fused sequence contained 42 codons of the Pre S2-S protein coding sequence together with the complete S 
protein coding sequence. Preferred functional DNA coding sequences Include the circumsporozorte protein 
coding seouenceof Plasmodium , or any Immunogenic derivative thereof, the envelope gene coding sequence 
of HfV orany Immu nogenic deriv ative thereof, especially the Or peptide, peptide 121 crDreesn^ peptide 
coding sequence or arry Immunogenic derivative thereof, and the Pre S2, Pre S1 or the entire Pre S1-PreS2-S 
proX^odlng sequence or anyknmunogente derivative thereof. For example, a preferred fencttonal M5NA 
coding sequence Is the first 13 Nterminal codons of the Pre S2 coding sequence which are then ligated to the 
C-termW42 codons of a truncated pre S2 region of a Pre S2-S protein coding sequence. By the term 
•immunogenic derivative' Is meart a sequence vvhichisa derivative or rno^^^ 

sequence which retains or enhances the protective immunogenic activity of °£ 
Immunogenic derivatives can be prepared by conventional techniques. The term ^fj.^SI orthe entire 
Pre S1-Pre S2-S protein coding sequence' as used herein Includes any Immunogenic derivative thereof. 

Bv the term " the clrcumsporozoite protein of Plasmodium' is meant the Immunodominant surface antigen 
on tiie sporozolte of Plasmodium or any Immunogenic derivative thereof. Useful circumsporozorte proteins 
(CS) include, but are not limited to. those derived from Plasmodium falciparum . P. yjyax. P. ovate, and P. 
malariae. P. falciparum, P. yjyax, P. ovale and Pjnalariae Infect humans. 
-n^^^^lngi^^^^no^m falciparum is disclosed 

593^99 (1984). The CS protein coding sequence of K yjyax is disclosed by Amot et al.. Science, 230, 815-018 
(1985) The CS protein coding sequence of P. knowlesl is disclosed by Sharma et al.. Scence 229, 779-782 
1985 Enea et al., Proc. Nati. Acad.Scl.USA7 81, 7520-7524 (1984). disclose the CS protein coding sequence 
of P.cynomolgl. The CS protein coding sequences of other species of Plasmodium are known (see references 
cited above) and/or can be derived by conventional techniques. . or ^ mwit a 

As used herein, the terms 'CSP' or 'CS protein' win Include both the natural, fell length ctrajim^rozorte 
Drotein of Plasmodium as well as any Immunogenic derivative thereof. By the term Immunogertcaenvanve or 
fee clrcu msporozoite protein of Plasmodium is meant (a) any derivative of the ^^^ffj 3 ^^ 
Plasmodium which retains protective Immunogenic activity or (b) any protein derived from a rnodffled coding 
sequence of the circumsporozorte protein of Plasmodium which retains protective Immunogenic activrtyJSuch 
immunogenic derivatives can be prepared by conventional techniques. Some mmunoger^ s>mtiTOtJc 
derivatives of the CS protein coding sequence of P. falciparum are disclosed in BaBou et al. Science. 228. 
996499 (1985). See also. Mazier et aL. Science, 231. 156-159 (1986). 

The CS coding sequence for various species of Plasmodium is known and can be preparedor teolatedby 
known techniques. [See. e.g., Colman et al., WO84-02922-A (P. knowlesl CS protein) and Dame et al.. Science , 
225,593 (1984) (P. falciparum CS protein)]. A DNA molecule containing the CS coding sequence, or anyotrier 
functional DNA coding sequence, fused In phase to a portion of the Pre S2 region of a HBV Pre S2-S protein 
coding sequence can be constructed by conventional techniques, such as by B flatog the cod^g 
sequence, or any other functional DNA coding sequence, m phase to any codon within the 55 amino acid pre 
S2 coding region of the Pre S2-S protein coding sequence. 

Preferably, enough of the Pre S2 coding sequence remains after ligation of the CS coding sequence, orany 
other functional DNA coding sequence, to insure optimal presentation of the CS Protein, or any other 
functional DNA coding sequence protein, on the HBsAg particle surface. Le., enough Pre S2 region should 
remain so that the Pre S2 region encoded polypeptide acts as a bridge between ^ functional Dt^ proteh 
coding sequence product and the HBsAg particle surface. It has been reported that the 26 amlno-terminal 
amino acids of the Pre S2 coding sequence represent a dominant antibody binding site °n^ 8 P ^ ^on. 
[See. e.g.. Millch et al.. Science, 228. 1 195-1199 (1985)]. Thus, If It is desired to prepare a hybrid particle which 
contains and/or presenteTo*PriS2 epitopes and the CS. orany other functional coding sequence, epitopes, 
it is preferable to Insert the CS coding sequence, or any other functional DNA coding s^uence, at a point 
within the Pre S2 coding region which enables preparation of a hybrid particle which contains and/or presents 
both Pre S2 epitopes and CS. or any other functional DNA coding sequence, epitopes. 

The recombinant vector of this invention comprises the recombinant DNA molecule of the subject invention 
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(I.e., a functional DNA coding sequence fused, tn phase, to a portion of the Pre S2 region of a HBV Pre S2-S 
protein coding sequence) operattvely linked to a regulatory region. Such vector may additionally also comprise 

* a repllcon functional In yeast. By the term "replicon" Is meant that minimum region of DNA which functions to 
maintain stably, and extrachromosomally, such recombinant vector in a host yeast organism transformed with 
such vector. Such replicons are well known in the art. By the term "regulatory region' is meant any DNA region 5 
which regulates transcription and translation of the coding sequence desired to be expressed. Such 
regulatory regions are well known in the art. Such a vector can be prepared by conventional techniques such 

as by Inserting the DNA molecule of this Invention In phase into a vector already containing a replicon and 
regulatory region in such a way that the DNA molecule is operative ty linked to such regulatory region. 
Preferably the DNA molecule is llgated to a vector capable of expression in the yeast genus Saccharohrryces. 10 

This invention also relates to a yeast host cell transformed with such recombinant vector and to a method of 
preparing such transformed host which comprises transforming a yeast host ceil wtth the recombinant vector 
of this invention. Such transformation Is carried out by conventional techniques such as those already 
described herein. Preferred yeast cells Include those belonging to the genus Saccharomyces, particularly 
those belonging to the species Saccharomyces cereviaiae and Saccharomyces cartsbargensls . 15 

This invention also relates to a method of preparing a hybrid particle containing HBsAg protein which 
contains and/or presents the functional DNA coding sequence product wherein such method comprises 
culturing the transformed yeast host cells of this invention In appropriate culture media, and Isolating the 
particle from a cell lysate or extract of such host's culture. By the term "presented" or "contains and/or 

# presents" is meant that the functional DNA coding sequence protein Is contained within the hybrid particle 20 
containing HBsAg protein in such a way as to enable an immune response to the functional DNA coding 
sequence protein to be produced. When desirable, although not essen tlal, the presentation of the functional 
DNA coding sequence protein does not interfere with the Immunogenic activity of the HBsAg epitopes, thus 

~ yielding a bivalent vaccine. Most preferably the functional DNA coding sequence protein is presented on the 

HBsAg particle in such a way that the maximum number of either the functional DNA coding sequence 25 
protein's epitopes or both the functional DMA coding sequence protein and HBsAg epitopes are properly 
exposed. By "appropriate culture media" is meant that media which will enable the transformed host to survive 
and which will also enable such host to express the hybrid functional DNA coding sequence-Pre S2-8 protein 
coding sequence protein product of the vector it is transformed with in recoverable quantity. It will be 
appreciated by one of skill in the art that the appropriate culture media to use will depend upon the host cell 30 
employed. The isolation of the hybrid particle of this Invention from a ceil lysate or extract of such host's 
culture is carried out by conventional protein isolation techniques. 

This invention also relates to a vaccine containing the hybrid particles of this Invention. Such vaccine will 
contain an immunoprotective quantity of the hybrid particles of this Invention and is prepared by conventional 
techniques. By "immunoprotective" is meant that enough of the hybrid particles of this Invention are 35 
administered to elicit sufficient protective antibody response against the agent desired to be protected against 
without serious side effects. Preferably, the vaccine of this invention comprises the micelle of this invention, 
i.e., a micelle comprising the particle of this invention and polysorbate. The preferred amount of polysorbate 
comprised by such micelle Is 5-60 ug polysorbate per 100 ug protein. The micelle may be prepared by the 
9 method disclosed in European Patent Application Publication No. 0 199 698. 40 

In the vaccine of the invention, an aqueous solution of the hybrid particles of the Invention, preferably 
buffered at physiological pH, can be used directly. Alternatively, the particle, can be admixed or adsorbed with 
any of the various known adjuvants. Such adjuvants Include, among others, aluminum hydroxide, 
t Vaccine preparation is generally described In New Trends and Developments In Vaccines, edited by Voller et 

a!., University Park Press, Baltimore, Maryland, USA., 1978. 45 

The amount of the hybrid particle of the present Invention present In each vaccine dose is selected as an 
amount which induces an Immunoprotective response without significant adverse side effects In typical 
vaccines. Such amount will vary depending upon which specific immunogen is employed and whether or not 
the vaccine Is adjuvanted. Generally, It is expected that each dose will comprise 1-1000 ug of protein, 
preferably 1-200 ug. An optimal amount for a particular vaccine can be ascertained by standard studies 50 
involving observation of antibody titres and other responses In subjects. Following en Initial vaccination, 
subjects will preferably receive a boost in about 4 weeks, followed by repeated boosts every six months for as 
long as a risk of infection exists. 

The Immune response to the hybrid particle of this invention may be enhanced by the use of adjuvant, such 
as, but not limited to, aluminum hydroxide (alum). 55 

To exemplify the hybrid particle of this invention, a 64 codon DNA sequence coding for the repetitive epitope 
of the clrcumsporozolte protein (CS) of Plasmodium falciparum and an eight codon CS coding sequence [see 
Dame et ai, Science , 225 , 593-699 (1984)] were each inserted in phase to a portion of the Pre S2 coding 
sequence in a yeast expression vector comprising a replicon, the Pre S2-S protein coding sequence and an 
appropriate regulatory region. Both expression vectors, i.e., one containing the 64 codon Cs sequence and 60 
one containing the 8 codon CS sequence, were each Introduced into yeast host cells, and extracts of such 
transformed cells were analyzed for the presence of HBsAg epitopes and CS epitopes on the same molecule. 

The results obtained from Immunoblot analyses using a pool of five monoclonal antibodies specific for CS 
and a monoclonal antibody raised against purified yeast-derived HBsAg showed that a 37,000 kllodaiton (Kd) 
hybrid protein was obtained from tysates of yeast transformed with the vector containing the 64 codon CS 65 
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coding sequence fused in phase with the Pre protein coding sequence, and a 30.000 ^^ridproteln 
was obtained from lysates of yeast transformed with the vector containing , toej S "f^^fhto SS 
fused in phase with the Pre S2-S protein coding sequence. Assembly of these proteins Into HBsAg 
Scl^ke Structures was shown by CsCi and sucrose gradient analysis, high = ure nqu^d 
5 chromatography (HPLC) and electron microscopy. Presentation of the CS epitopes on the e^orof such 
particles wasVhown by the accessibility of the Cs epitope to CS specific antibodies in an EUSA assay. 
SesentofiorTof HBsAg epitopes on the exterior of such particles was shown by radlohnmune assay (AUSTRIA 
II. Abbott) or EUSA assay (Enzygnost-HBsAg. Behrlngwerke). 

10 Mn\/Fi prp S? AN D PRE S2-S PROTEIN CODING SEQUENCES 

The novel HBV Pre S2 and Pre S2 -S protein coding sequences of this invention are toman JBV adw 
subtype which was isolated from ptasmid P RrT10616. The term "coding sequence- or ~dlng^on asused 
herein also encompasses any functional derivative thereof. By the term ^ncttonal derivative 
coding sequence wtthlmlno acid alterations which, where appropriate don^ Interfere w^partic^rmation 

15 and/or which retain immunogenic where such retention ^^^J^S^Ji^S^t 
prepared by conventional site specific mutagenesis such as by the method of Botsteln « ^§£S&£!; 
1193-12D1 f1985l Such Pre S2 and Pre S2-S protein coding sequences can be recovered by conventional 
naSn^bTnant SS p^^^TplaVmid P Rm0616 which was deposited (within E^ ooH KI 2 strain C600) h 
SMfeSUM AmericanType Culture Collection. >!£™*2Z22i£L 

20 1982,under accession number ATCC 38131. The Pre S2 region of such novel protein coding sequences codes 
for the following amino acid sequence: 

-55 -50 
2s Met-Gln-Trp-Asn-Sec-Thr-Ala-Phe-His-Gln-Ala-^eu-Gln-Asp- 

-40 - 30 



30 



Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pco-Ala-Gly-Gly-Ser-Ser- 

-20 

Ser-Gly-Thr-Val-Asa-Pro-Ala-Pro-Asn-ile-Ala-Sec-His-lle- 



-10 

35 ser-Ser-Ser-Sec-Ala-Arg-Thr-Gly-Asp-Pco-Val-Thr-Asn. 

Such region may also be obtained by conventional synthetic oligonudeotide synti^te. * Purred 
functionaJ^rivative of the Pre S2-S coding sequence of tills ^^^^S^S^^ii 

AO acid at -45 is modified via site specific mutagenesis to a threonine (ACT), f^^ 0 ^ 6 *^^ " 
two times greater level of expression bv Saccharomvces cerevesiae transformed therewith as com pared to 
the unmodified coding sequence of this Invention. . 

A reaMnWnarrtwctor of this Invention comprises the Pre S2 or Pre-S2-S *<*%^™^*^ 
subject invention operatively linked to a regulatory region. Such vector may also additionally a 

45 depending on prefere^and/or the host to be employed By the term ''^^^ySLtSZ 
region of DMA which functions to maintain stably, and extrachromo^omally such ^^J"*"^ 0 ^" *^ 
eukaryotic or prokaryotlc organism transformed with such vector. Such repllcons are weWcnown In the . artBy 
fte term "regulatory region" is meant any DMA sequence which contains a promoter region and other 
sequences nicest to? transcription of a coding sequence. Such ^^^^^^^'^ 

50 art Such a vector can be prepared by conventional techniques such as by Inserting the Pre S2 or Pre sss* 
P^tein c^d^q^nce of Sinvertton in phase into a vector already containing a repflcc^ ^ory 
region In such away that the DNA molecule Is operatively Dnked to such regulatory region. Preferably the DMA 

molecule Is ligated to a vector capable of expression In yeast such as Saccharornyces. 

TwTlnvention also relates to ahost cell transformed with such recombinant vector and to a metood of 

55 preparing such transformed host Such transformation is carried out by wnverrttonal ^raq^Hos^ t cefls 
canbe prokaryotic or eukaryotic. Preferred host cells are eukaryotic especially yeast cells and Irwludethose 
Sglngto^ 

IS re^Sthod of preparing a protein encoded by the Pre « t or Pre S^S pro tej 
60 coding sequence of this invention wherein such method crcmpn^ culturing toe J ° ^^*7, 

Invention in appropriate culture media, and Isolating the protein from toe host cell cutture fluid Tor from a ceU 
lysate or extract of such hosfs culture depending upon toe host cell'sabnity to seaato™* P^eh- * 
•appropriate culture media' Is meant that media which will enable the transformed host of ^ ^^n to 
sunSveand which will also enable such host to express the Pre S2 orPre S2-S P|f^^^^2tore 
65 invention in recoverable quantity. It will be appreciated by one of skill In the art that the appropriate culture 
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media to use will depend upon the host cell employed. The Isolation of the Pre S2-S protein of this Invention 
from a culture lysate or the culture fluid of such host Is carried out by conventional protein isolation 
techniques. The protein prepared by the method of this Invention may be glycosylated depending upon the 
host cell's ability to do so. If a deglycosylated protein is desired, the protein of this Invention should be 
prepared by utilizing a host cell Incapable of effecting such glycosylate or by employing conventional site 
specific mutagenesis on the protein's coding sequence prior to carrying out the process of this invention, 
such as by the method of Botstein et ah, Science , 229, 1193-1201 (1985). 

This invention also relates to a vaccine containing an Immunoprotective amount of the Pre S2 or Pre S2-S 
protein of this Invention. Such vaccine may be prepared by conventional techniques. 

This Invention also relates to a method of preparing a hybrid particle containing HBsAg protein which 
comprises protein encoded by the PreS2-S protein coding sequence of this invention wherein such method 
comprises culturing the transformed eukaryotic host cells of this invention in appropriate culture media, and 
isolating the particle from the host cell culture fluid or from a cell lysate or extract of such hosts culture 
depending upon the transformed host cells ability to secrete such particles. By -appropriate culture media" Is 
meant that media which will enable the transformed host of this invention to survive and which will also enable 
such host to express the Pre S2-S protein coding sequence of this Invention In particle form and In recoverable 
quantity. It wilt be appreciated by one of skill in the art that the appropriate culture media to use will depend 
upon the host ceil employed. The Isolation of the hybrid particle of this Invention from a culture lysate or the 
culture fluid of such host Is carried out by conventional Isolation techniques. 

This invention also relates to a vaccine containing the hybrid particles of this Invention. Such vaccine will 
contain an immunoprotective quantity of the hybrid particles of this Invention and are prepared by conventional 
techniques. Preferably, the vaccine of this Invention comprises the micelle of this invention. I.e., a micelle 
comprising the particle of this Invention and polysorbate. The preferred amount of polysorbate comprised by 
such micelle Is 5-50 jig polysorbate per 100 jxg protein. The mlceOe may be prepared by the method disclosed 
in European Patent Application Publication No. 0 199 698. 

In the vaccine of the Invention, an aqueous solution of the Pre S2-S protein or hybrid particle of the 
invention, preferably buffered at physiological pH, can be used directly. Alternatively, the protein or particle 
can be admixed or adsorbed with any of the various known adjuvants. Such adjuvants Include, among others, 
aluminum hydroxide. 

Vaccine preparation is generally described In New Trends and Developments In Vaccines, edited by Voileret 
al.. University Park Press, Baltimore, Maryland, U.S.A., 1978. Encapsulation within liposomes is described, for 
example, by FuUerton, U.S. Patent 4.235,877. Conjugation of proteins to macromolecules Is disclosed, for 
example, by Ukhite, U.S. Patent 4,372,945 and by Armor et al., U.S. Patent 4,474,757. 

The amount of the Pre S2-S protein or hybrid particle of the present Invention contained In each vaccine 
dose Is selected as an amount which Induces an Immunoprotective response without significant, adverse side 
effects In typical vaccines. Such amount win vary depending upon which specific immunogen is employed and 
whether or not the vaccine is adjuvanted. Generally, It Is expected that each dose win comprise 1-1000 WJ of 
protein, preferably 1-200 w>. An optimal amount for a particular vaccine can be ascertained by standard 
studies Involving observation of antibody tltres and other responses in subjects. Following an initial 
vaccination, subjects will preferably receive a boost In about 4 weeks, followed by repeated boosts every six 
months for as long as a risk of infection exists. 

The immune response to the Pre S2-S protein or hybrid particle of this invention Is enhanced by the use of 
adjuvant such as. but not limited to, alum. 

N OVEL PRE S1 PROTEIN CODING SEQUENCE 

The novel HBV Pre S1 protein coding sequence of this invention is from an HBV adw subtype which was 
isolated from plasmkJ pRIT10616. The term 'coding sequence' or 'coding region' as used herein also 
encompasses any functional derivative thereof. By the term functional derivative' is meant a coding sequence 
involving amino acid alterations which, when desired, dont interfere with particle formation and/or which retain 
Immunogenicity. Such functional derivatives can be prepared by conventional site specific mutagenesis such 
as bv the method of Botstein et al., Science , 229. 1193-1201 (1985). Such Pre S1 protein coding sequence can 
be recovered by conventional recombinant DNA procedures from plaamld pRIT10616 which was deposited 
(within E. coll K12 strain C600) in accordance with the Budapest Treaty at the American Type Culture 
Collecflon.~Rockville, Maryland, on June 2, 1982 under accession number ATCC 38131. The Pre S1 region 
codes for the following amino acid sequence: 
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ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Sec Val Pro Asa Pro Leu 

5 GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

10 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

15 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys'Asp His Trp 

20 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

25 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

30 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

35 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

40 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

45 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

50 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

55 



Such region may also be obtained by conventional synthetic oDgonucleotlde synthesis. 

A recombinant vector of this invention comprises the Pre S1 protein coding sequence of the subject 
invention operatively linked to a regulatory region and may additionally comprise a repOcon depending on 
preference and/or the host to be employed. By the term 'replicoa" Is meant that minimum region of DNA 
65 which functions to maintain stably, and extrachromosomally. such recombinant vector In a host eukaryotlc or 
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prokaryotic organism transformed with such vector. Such reptlcons are well-known In the art. By the term 
"regulatory region" Is meant a DNA sequence which contains a promoter region and other sequences 
necessary for transcription of a coding sequence. Such regulatory regions are well-known In the art Such a 
vector can be prepared by conventional techniques such as by Inserting the Pre S1 protein coding sequence 
of this Invention in phase into a vector already containing a repQcon and regulatory region In such a way that 5 
the DNA molecule is operatfvely linked to such regulatory region. Preferably the DNA molecule is ligated to a 
vector capable of expression In yeast such as Saccharomyces . 

This Invention also relates to a host cell transformed with such recombinant vector and to a method of 
preparing such transformed host. Such transformation is carried out by conventional techniques. Host cells 
can be prokaryotic or eukaryotlc. Preferred host cells are eukaryotlc especially yeast cells and Include those 10 
belonging to the genus Saccharomyces , particularly those belonging to the species Saccharomyces 
cerevlslae and Saccharomyces cartsbergensls . 

This invention also relates to a method of preparing protein encoded by the Pre S1 protein coding sequence 
of this invention wherein such method comprises cutturlng the transformed host cells of this Invention In 
appropriate culture media, and isolating the protein from the host cell culture fluid or from a cell tysate or 15 
extract of such host's culture depending upon the transformed host cell's ability to secrete such protein. By 
"appropriate culture media" is meant that media which will enable the transformed host of this invention to 
survive and which will also enable such host to express the Pre S1 protein coding sequence of this invention In 
recoverable quantity. It will be appreciated by one of skill In the art that the appropriate culture media to use will 
depend upon the host cell employed. The Isolation of the Pre S1 protein of this Invention from a culture lysate 20 
or the culture fluid of such host is carried out by conventional isolation techniques. The protein prepared by the 
method of this Invention may be glycosylated depending upon the host cell's ability to do so. If a 
degtycosytated protein Is desired, the protein of this Invention should be prepared by utilizing a host cell 
incapable of effecting such gtycosytatton or by employing conventional site specific mutagenesis on the 
protein's coding sequence prior to carrying out the process of this Invention. Site specific mutagenesis Is 25 
carried out by such as by the method of Botstein et al., Science , 229 , 1193-1201 J1985). 

This invention also relates to a vaccine containing the Pre S1 protein of this Invention. Such vaccine will 
contain an Immunoprotective quantity of the Pre S1 protein of this Invention and is prepared by conventional 
techniques. 

In the vaccine of the Invention, an aqueous solution of the Pre S1 protein of the Invention, preferably 30 
buffered at physiological pH, can be used directly. Alternatively, the Pre S1 protein, can be admixed or 
adsorbed with any of the various known adjuvants. Such adjuvants Include, among others, aluminum 
hydroxide. 

Vaccine preparation is generally described In New Trends and Developments in Vaccines, edited by Voder et 
at.. University Park Press, Baltimore, Maryland, U.S.A., 1978. Encapsulation within liposomes is described, for 35 
example, by Fullerton, U.S. Patent 4.235,877. Conjugation of proteins to macromolecules is disclosed, for 
example, by Ukhite, U.S. Patent 4,372,945 and by Armor et al.. U.S. Patent 4,474,757. 

The amoung of the Pre S1 protein of the present invention present in each vaccine dose is selected as an 
amount which Induces an immunoprotective response without significant, adverse side effects In typical 
vaccines. Such amount will vary depending upon which specific immunogen Is employed and whether or not 40 
the vaccine is adjuvanted. Generally, it is expected that each dose will comprise 1-1000 ug of protein, 
preferably 1-200 jig. An optimal amount for a particular vaccine can be ascertained by standard studies 
involving observation of antibody tttres and other responses in subjects. Following an Initial vaccination, 
subjects will preferably receive a boost in about 4 weeks, followed by repeated boosts every six months for as 
long as a risk of infection exists. The immune response to the PreS1 protein of this invention Is enhanced by 45 
the use of adjuvant such as. but not limited to, alum. 

NOVEL PRES1 - PRES2-S PROTEIN CODING SEQUENCE 

This Invention also relates to a HBV PreS1-PreS2-S protein coding sequence of which the PreS1-PreS2 
portion comprises the following amino acid sequence: 50 



55 



60 



65 



19 



0278940 



PRg-Sl REGION 
-163 . 

ATG GGG ACG AAT CTT 
Met Gly Thr Asn Leu 

GGA TTC TTT CCC GAT 
Gly Phe Phe Pro Asp 

GCA TTC GGA GCC AAC 
Ala Phe Gly Ala Asa 

TGG GAC TTC AAC CCC 
Trp Asp Phe Asn Pro 

CCA GCA GCC AAC CAG 
Pro Ala Ala Asn Gin 

TTC GGG CCA GGG CTC 
Phe Gly Pro Gly Leu 

GGT ATT TTG GGG TGG 
Gly He Leu Gly Trp 

GGC ATA TTG ACC ACA 
Gly He Leu Thr Thr 



TCT GTT CCC AAC CCT CTG 
Ser Val Pro Asn Pro Leu 

CAT CAG TTG GAC CCT 
His Gin Leu Asp Pro 

TCA AAC AAT CCA GAT 
Ser Asn Asn Pro Asp 

ATC AAG GAC CAC TGG 
He Lys Asp His Trp 

GTA GGA' GTG GGA GCA 
Val Gly Val Gly Ala 

ACC CCT CCA CAC GGC 
Thr Pro Pro His Gly 

AGC CCT CAG GCT CAG 
Ser Pro Gin Ala Gin 

GTG TCA ACA ATT CCT 
Val Ser Thr He Pro 
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CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCP CCA CCT CTA 
Arg Gin Pro Thr Pro lie Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 
PRE-S2 REGION 

-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 



10 



15 



20 



25 



TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA & 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT ss 
Thr Val Asn Pro Ala Pro Asn He Ala Sec 

CAC ATA TCG TCA AGC TCC GCG' AGG ACT GGG 40 
His He Ser Ser Sec Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 45 
Asp Pro Val Thr Asn 

so 

The novel HBV Pre S1-Pre S2-S protein coding sequence of this invention Is from an HBV adw subtype 
which was isolated from plasmid pFuTt0616 which, as stated above. Is available from the American Type 
Culture Collection, Rockville, Maryland, under accession number ATCC 38131. The term "coding sequence" or 
"coding region" as used herein also encompasses any functional derivative thereof. By the term "functional 
derivative" is meant a coding sequence with amino acid alterations which, where appropriate, dont Interfere & 
with particle formation and/or which retain immunogenlclty where such retention Is desirable. Such functional 
derivatives can be prepared by conventional site specific mutagenesis such as by the method of Botsteln et 
al.. Science, 229, 1193-1201 (1985). 

A recombinant vector of this Invention comprises the Pre-S1-Pre S2-S protein -coding sequence of the 
subject invention operative^ linked to a regulatory region. Such vector may also additionally contain a repllcon & 
depending upon preference and/or the host to be employed. By the term "repllcon" Is meant that minimum 
region of DMA which functions to maintain stably, the extrachromosomally, such recombinant vector In a host 
eukaryotic or prokaryotic organism transformed with such vector. Such repllcons are well-known in the art By 
the term "regulatory region" Is meant any DNA sequence which contains a promoter region and other 
sequences necessary for transcription of a coding sequence which regulates transcription of a structural & 
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25 



30 



gene's coding sequence. Such regulatory regions are well-known in the art. Such a vector prepared by 
conventional techniques such as by Inserting the Pre SI- Pre S2-S^rote n codtaig sequence of this tarvwrtton 
in phase into a vector already containing a repllcon and regulatory region in such a ^^^^{^J 
is operatively linked to such regulatory region. Preferably the DNA molecule is hgated to a vector capable of 
expression In yeast such as Saccharomyces. mtM%ni , 

This Invention also relates to a host cell transformed with such recombinant vector and to a method of 
preparing such transformed host Such transformation is carried out by conventional techniques Host cells 
can be prokaryotlc or eukaryotic. Preferred host cells are eukaryotic expecially yeast cells and Include those 
belonging to the genus Saccharomyces, particularly those belonging to the species Saccharomyces 
cerevtelae and Saccharomyces carisbergensis. nmt oin 
-^TWslnvention also relates to a method of preparing a protein encoded by the Pre SW£S2-S proteto 
coding sequence of this Invention wherein such method comprises culturing the transformed host ceHs of this 
invention to appropriate culture media, and isolating the protein from the host cell culture fluid or from a ceD 
JSe S S of such hosf s culture depending upon the host cell's abffltv to secret^ such P^teln By 
^appropriate culture media' is meant that media which will enable the transformed host of this Invention to 
sJSSveSwhkAwMalso enable such hc^toexp 

of this Invention In recoverable quantity. It will be appreciated by one of skffl h thai art that the appropriate 
cXre media to use wM depend upon the host cell employed. The isolation of the Pre S1-Pre S2-S P ro ^n 
STtovention from a culturVlysate or the culture fluid of such host Is carded out by 
isolation techniques. The protein prepared by the method of this Invention may be glycosytoted depending 
"SSSto do so. IfVdeg^osylated protein is desired, the protein ^teir^J should 
beprepared by utilizing a host cell Incapable of effecting such glycosylate or by employing 
specrficlmjtagenesis on the protein's coding sequence prior to caning out the process iofi ^Inverrtton. 
suchas by the method of Botsteln et al., Science , 229, 1193-1201 (1985). The Pre S1^S2-S cod^ 
sequence of this Invention will be expressed as a glycosylated protein by a yeast or mammalian tost cell 
SSfcrmed therewith. The Pre S1-Pre S2-S coding sequence of this Invention win be expressed as a 
N-myristylated protein by ayeast host cell transformed therewith. 

This invention also relates to a vaccine containing an Immunoprotective amount of the Pre 51- pre s*-» 
protein of this invention. Such vaccine may be prepared by conventional techniques- 

This Invention also relates to a method of preparing a hybrid particle oontaMng Mf^P"** 
comprises protein encoded by the Pre S1-PreS2-S protein coding sequence of this invention wherein such 
method comprises culturing toe transformed eukaryotic host cells of this invention in^oprtoe «^rture 
merf^and isolating the particle from the host cell culture fluid or from a cell lysate or ex&act offf^osfs 
cLltare depending upon toe transformed host cells ability to secrete such protein. By "app^e culture 
35 medtef is meant toat media which will enable the transformed host of this irwerrtion to survive and wMchwM 
" also enable such host to express the pre S1-Pre S2-S protein coding sequence of thfe Invention In particte 
£m aSn recoverable qu£tity. It win be appreciated by one of skin in the art tiptoe appro^culhjre 
media to use win depend upon the host cell employed. The isolation of toe particte of^ ^ention from a 
culture lysate or the culture fluid of such host is carried out by conventional isolation techniques 
40 TO? Son also relates to a vaccine containing the hybrid particles of this i™ntionJ^>^newfl 
contain an immunoprotective quantity of the nybrW particles of trfe Invention and a^^^ 
techniques. Preferably, the vaccine of this Invention comprteestoe mteeHe at ^-.amceUe 
comprising the particle of this Invention and polysorbate. The preferred amount of P°^ort^com pris^ by 
such micelle Is 5^0 ug polysorbate per 100 ug protein. The micelle may be prepared by the method disclosed 

45 in European Patent Application Publication No. 0 199 698. „♦ 

IntheWine of the Invention, an aqueous solution of the Pre S1-PreS2-S prc^ortoe hybrid particte £ 
the invention, preferably buffered at physiological pH. can be used directly. Alternatively, the protein or particle 
can be admixed or adsorbed with any of the various known adjuvants. Such adjuvants Include, among others. 

^acTe Pre^n is generally described in New Trends and Developmenteln Vaocinee. 
al.. Universtty Park Press. Baltimore. Maryland. U.SA.. 1978. Encapsulation within nposomes is described, or 
example, by FuHerton. U.S. Patent 4,235,877. Conjugation of proteins to macromclecwtes Is disclosed, for 
example, by Ukhite. U.S. Patent 4.372*45 and by Armor et al., U.S. Patent 4.W57. 

The amount of the Pre S1-Pre S2-S protein or hybrid particle of the present ^ entt °" P/^** 
vaccine dose Is selected as an amount which induces an Immunoprotective response witiiout s grfflcan^ 
adverse side effects in typical vaccines. Such amount win vary depending upon which specific immunogenis 
employed and whether or not the vaccine is adjuvanted. Generally, it is expected that each dose will «^Prise 
1-1O00 ug of protein, preferably 1-200 ug. An optimal amount for a particular vaccine can be^rtaJnedby 
staSart studies InvoMng observation of antibody titres and other ^P^ 3 ^'"^^^"?^,^ 
vaccination, subjects will preferably receive a boost In about 4 weeks, followed by repeated boosts every sk 
months for as long as a risk of infection exists. . 

The immune response to the Pre S1-Pre S2-S protein or hybrid particle of this invention Is enhanced by the 
use of adjuvant such as, but not limited to, alum. 
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EXPRESSiON OF PRES1-PRES2-S PROTEIN IN YEAST 

This Invention also relates to a recombinant DNA vector comprising an entire PreS1-PreS2-S protein coding 
sequence ope natively linked to an expression control sequence. The term "coding sequence 11 or "coding 
region" as used herein also encompasses any functional derivative thereof. By the term "functional derivative" 
is meant a coding sequence with amino acid alterations which, where appropriate, dont Interfere with particle 5 
formation and/or which retain Immunogeniclty where such retention is desirable. Such functional derivatives 
can be prepared by conventional site specific mutagenesis such as by the method of Botstem et ah, Science , 
229 , 1193-1201 (1985). Such vector may additionally comprise a replicon depending upon preference and/or 
the host to be employed. Preferably, the PreS1-PreS2-S protein coding sequence employed Is the one of this 
Invention. 10 

By the term "replicon" is meant that minimum region of DNA which functions to maintain stably, and 
extrachromosomally, such recombinant vector In a host yeast organism transformed with the vector of this 
invention. Such replicons are well-known In the art. By the term "regulatory region" Is meant any DNA 
sequence which contains a promoter region and other sequences necessary for transcription of a coding 
sequence which regulates transcription of a structural gene's coding sequence. Such regulatory regions are 15 
well-known in the art. Such a vector can be prepared by conventional techniques such as by Inserting a PreS1- 
PreS2-S protein coding sequence in phase Into a vector already containing a replicon and regulatory region In 
such a way that the DNA molecule Is operatively linked to such regulatory region. Preferably the DNA molecule 
is Bgated to a vector capable of expression In the yeast genus Saccharomyces. Preferably the Pre S1- Pre S2-S 
protein coding sequence contained by the vector of this Invention Is the Pre S1- Pre S2-S protein coding 20 
sequence of this Invention described hereinabove. Such vectors are also useful for the Insertion therein of 
functional DNA coding sequences to enable expression of the resulting Pre Si-functional DNA coding 
sequence-Pro S2-S protein coding sequence comprised by such vector. Therefore, this Invention also relates 
to a recombinant DNA molecule comprising afunctional DNA coding sequence fused hi phase Into the Pre S1 
region of the Pre S1 -Pre S2-S protein coding sequence, in such a way that the functional DNA sequence either 25 
forms an Insertion In the Pre S1 region or replaces part of the Pre S1 region and possibly a portion of the Pre 
S2 region, and to a vector comprising such recombinant DNA molecule. See, for example, Example 21 A, Infra. 
By the term functional DNA coding sequence" is meant a DNA coding sequence which, when fused, In phase, 
into the Pre SI region of the Pre S1-Pre S2-S protein coding sequence, Is not Involved with, and does not 
Interfere with, the assembly of the HBsAg particle. Preferred functional DNA coding sequences include but are 30 
not limited to, the clrcumsporozoite coding sequence of Plasmodium or any immunogenic derivative thereof, 
the envelope gene coding sequence of HIV or any Immunogenic derivative thereof, especially the C7 peptide, 
peptide 121 or Drees man peptide coding sequence as hereinafter exemplified, or sequences coding for 
peptides of interest from other viruses especially those for coat proteins or for surface proteins from other 
parasites. 35 

This invention also relates to a yeast host cell transformed with such recombinant vector and to a method of 
preparing such transformed host Such transformation Is carried out by conventional techniques. Preferred 
yeast cells and Include those belonging to the genus Saccharomyces , particularly those belonging to the 
species Saccharomyces cerevislae. 

This Invention also relates to a method of preparing a protein encoded by a Pre S1-Pre S2-S protein coding 40 
sequence or Pre Si-functional DNA coding sequence -Pre S2-S protein coding sequence wherein such 
method comprises culturing the transformed host cells of this invention In appropriate culture media, and 
isolating the protein from the host cell culture fluid or from a cell tysate or extract of such host's culture 
depending upon the host cell's ability to secrete such protein. By "appropriate culture media" Is meant that 
media which will enable the transformed host of this Invention to survive and which will also enable such host 45 
to express a Pre S1-Pre S2-S protein coding sequence or Pre S1 -functional DNA coding sequence - Pre S2-S 
protein coding sequence product In recoverable quantity. It win be appreciated by one of skill in the art that the 
appropriate culture media to use will depend upon the host cell employed. The Isolation of the Pre 31 -Pre S2-S 
protein or Pre S1 -functional DNA coding sequence - Pre S2-S protein from a ceil lysate or the culture fluid of 
such host culture Is carried out by conventional protein isolation techniques. The protein prepared by the 50 
method of this Invention may be glycosylated depending upon the host cell's abfflty to do so. If a 
degtycosylated protein is desired, the protein of this Invention should be prepared by utilizing a host cell 
incapable of effecting such glycosytation or by employing conventional site specific mutagenesis on the 
protein's coding sequence prior to carrying out the process of this Invention, such as by the method of 
Botsteln et al., Science , 229 , 1193-1201 (1985). 55 

This invention also relates to a vaccine containing an immunoprotective amount of Pre-S1 -Pre S2-S protein 
or Pre S 1 - functional DNA coding sequence - Pre S2-S protein produced by the method of this Invention. Such 
vaccine may be prepared by conventional techniques. 

This Invention also relates to a method of preparing a hybrid particle which comprises protein encoded by a 
PreS1-PreS2-S protein coding sequence or Pre S1 -functional DNA coding sequence - Pre S2-S protein 60 
coding sequence, wherein such method comprises culturing the transformed yeast host cells of this Invention 
In appropriate culture media, and Isolating the particle from the host cell culture fluid or from a cell lysate or 
extract of such host* s culture depending upon the transformed host cells ability to secrete such particles. By 
"appropriate culture media" Is meant that media which will enable the transformed host to survive and which 
will also enable such host to express the Pre S1-Pre S2-S protein coding sequence or Pre S1 -functional DNA 65 
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coding sequence - Pre S2-S protein coding sequence in particle form and In recoverable quantity ft win be be 
appreciated by one of skill in the art that the appropriate culture media to use will depend upon the host cell 
employed. The Isolation of the particles of this Invention from a culture lysate or the culture fluid of such host Is 
carried out by conventional Isolation techniques. 
5 This Invention also relates to a vaccine containing the particles of this Invention. Such vaccine will contain an 
immunoprotective quantity of the particles of this Invention and are prepared by conventional techniques. 
Preferably, the vaccine of this Invention comprises the micelle of this Invention, l.e., a micelle compnstfng the 
particle of this Invention and polysorbate. The preferred amount of polysorbate comprised by such micelle is 
5-50 ng polysorbate per 100 jig protein. The micelle may be prepared by the method disclosed in European 
10 Patent Application Publication No. 0 199 698. 

In the vaccine of the Invention, an aqueous solution of the Pre S1 -Pre S2-S protein or Pre S1 functional DNA 
coding sequence - Pre S2^S protein or the particle of the Invention, preferably buffered at physiological pH. 
can be used directly. Alternatively, the protein particle can be admixed or adsorbed with any of the various 
known adjuvants. Such adjuvants Include, among others, aluminum hydroxide. 

Vaccine preparation is generally described in New Trends and Developments In Vaccines, edited by Voller et 
ah. University Park Press. Baltimore. Maryland, U.S.A.. 1978. Encapsulation within liposomes is described, for 
example, by Fuilerton, U.S. Patent 4,235,877. Conjugation of proteins to macromolecules Is disclosed, for 
example, by Ukhite. U.S. Patent 4.372.945 and by Armor et al., U.S. Patent 4.474,757. 
The amount of the Pre S1~Pre S2-S protein or Pre S1-functional DNA coding sequence - Pre S2-S protein or 
20 particle of the present invention present In each vaccine dose is selected as an amount which Induces an 
Immunoprotective response without significant, adverse side effects in typical vaccines. Such amount win vary 
depending upon which specific Immunogen is employed and whether or not the vaccine Is adjuvanted. 
Generally, It is expected that each dose will comprise 1-1000 ng of protein, preferably 1-200 ^^optimal 
amount for a particular vaccine can be ascertained by standard studies involving observation of antibody titres 
25 and other responses In subjects. Following an Initial vaccination, subjects will preferably receive a boost In 
about 4 weeks followed by repeated boosts every six months for as long as a risk of infection exists. 

The Immune response to the Pre S1-Pre S2-S protein or Pre S1-functional DNA coding sequence - Pre S2-S 
protein or particle of this Invention Is enhanced by the use of adjuvant such as, but not limited to, alum. 

30 EXAMPLES 

In the following examples of the invention, which are illustrative and not limiting 

- All percentages are by weight and all temperatures are in degrees Celsius (Centigrade). 

-The enzymes used In DNA manipulations were obtained from Bethesda Research Laboratones. New England 
Biolabs, and/or Boehringer. and were employed according to the suppliers 1 directions xnmn . 
35 - Yeast growth media : selective medium : YNB (yeast nitrogen base without amino acids (Dlfco Labs) 0.675 <Vb 
(w/v) containing 2<Vb (w/v) glucose). 

- YEPD medium : 1 <Vb (w/v) yeast extract, 2% w/v peptone and 2% (w/v) glucose. 

- DNA recombinant methods are described in -Molecular Cloning". T. Manlatis et al., Cold Spring Harbor Lab. 
(1982). 

40 - PMSF : phenyl methyl sulphonyl fluoride 

- The following abbreviations may be employed: 

PBS : Phosphate Buffered Saline (per liter): (pH 7.4) 
8.0 gram NaCi 
45 02 gram KCI 

1.15 gram NsfcHPCU 
0.2 gram KH2PO4 
0.1 gram CaCk 
0.1 gram MgCfe • 6H2O 

50 

BSA: Bovine Serum Albumin (A4378, Sigma) 

X-gal : S^romo^hlorolndoyl-^D-galactopyranoside, commercially available from Sigma Chemical Co., St. 
Louis, Missouri. 

55 L-BROTH (per liter): 
10 gram Tryptone 
5 gram Nad 
5 gram Yeast extract 

1 ml 0.1 M MgS04 mj m ^ r me%r 

60 Add 5 mTof aseptic solution of thiamine (1 mg/ml) after autodaving. For solid media, add 15 g agar per liter. 

Example 1. Construction of plasmid pRFT10167. _ k(A ^ 

The start ing material was plasmid p6y containing a 2.1 kb Hlndlll fragment of yeastDNA encoding the TDH3 
gene cloned on pBR322. pBR322 Is described by Bolivar et al. (Gene, 2, 95-113, 1977). The p6y plasmid was 
65 constructed and characterized by Musti et al. (Gene, 25, 133, 1983) and received from Dr. M. Rosenberg of the 
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National Institute of Health. The Hindi)) fragment was redoned onto a pBR322 derivative In which the EcoRl site 
had been destroyed to give piasmld pRfT10164 (a non essential manouver). The following manipulations were 
carried out so as to create a BamH I site located at the Q residue of the ATQ codon of the TDH3 coding 
sequence and to obtain a Hlnd lll -Bam HF DNA fragment with TDH3 promoter activity In which the TDH3 
sequences are intact and unchanged by the manipulation. DNA of pRIT10164, prepared as described above, 5 
was purified by two cycles of CsCJ - ethldium bromide density gradient centrffugalion essentially as described 
by Kahn et al. (Methods in Enzymology, 68, 268, 1979). 150 ag of pRIT10164 DNA were digested to completion 
with 75 units of Xbai endonuclease, extracted with phenol and ether, precipitated with ethanoi and the DNA 
resuspended In 0.01 Mtromethamine-HCi buffer at a concentration of 1 fxg per pJ. Samples of 20 ug of this Xbal 
treated DNA were digested with BaJ31 nuclease to resect the 61 base pairs of DNA between the ATQ codon 10 
and the Xbal site. Dlgestons with BaJ31 were carried out at 30° for 1 to 3 minutes in buffer containing 600 mM 
Naa, 12 mM CaCI2, 12 mM MgCE, 1 mM EDTA, 20 mM tromethamlne-HCI, pH3.1 using one unit of BaJ31 
nuclease per 20 jig DNA In a final reaction volume of 200 uJ. 

The enzyme reactions were stopped by addition of ethylenebls (oxyethylenerntrHo) tetraacetic acid (EQTA) 
to give 20 mM final concentration and the samples were extracted with equal volumes of phenol, ether and 15 
precipitated with ethanoi. Each DNA sample was resuspended In 20 ni 10 mM tromethamlne-HCI buffer pH 7.5. 
The extent of BaJ31 digestion was measured by digesting about 2.5 |ig of each DNA sample with Hpal 
endonuclease and comparing the size of the Hpal-Xbal fragments to the 335 bp Hpai-Xbai fragment from 
pRIT10164. A 2 minute digestion with Bal31 was found to remove about 41 to 88 nucleotides from the 
Xbal-Hpal fragment of pRiT10164. 20 

This result indicates that a similar number of base pairs had been removed from the other Xbal extremity 
towards the ATQ codon. 5 |ig of the DNA recovered from the 2 minute Bal31 digest was digested with Bam HI 
endonuclease, extracted with phenol and recovered by ethanoi precipitation. This DNA was treated with 5 
units of T4 polymerase in the presence of deoxynucleotlde triphosphates to fill In and render flush ended the 
Bam HI and any Bal31 single stranded extremities, extracted with phenol and recovered by ethanoi 25 
precipitation. 2J3 |ig of this DNA was treated with 5 units of T4 DNA ligase and half of the ligation mixture used 
to transform competent celts of E. coil K12 strain MM294 prepared according to the method of Cohen et al. 
(Proc. Natl. Acad. Scl. 69, 2110, 1972). One ml of the transformed E. coil population was diluted into 350 ml 
L-broth with 200 jig/ml ampteUlin and total plasmid DNA was purified from the resulting culture. 

80 |ig of this plasmid DNA containing a population of plasmid molecules with Baj31 induced deletions of 90 
about 40 to 80 base pairs was digested sequentially wigh 75 units of Hindlll endonuclease and 96 units of 
BamH I endonuclease to release DNA fragments from those plasmids in which a Bam HI site had been 
recreated after the treatments described above, that is, where the Baj31 digestion had terminated at a Q 
residue. This DNA digest was run on a preparative 1<Vb agarose gel and the desired Hindi P-BamHl fragments 
corresponding to sizes from 1 100 to 1000 bp excised from the gel in two slices, one corresponding to DNA of 35 
about 1070 bp and the other of about 1030 bp. The DNA was recovered from the two agarose gel slices by 
cycles of freezing and thawing followed by high speed centrifugatlon to pellet the agarose. The supernatant 
liquid was filtered through a Mlilipore QV Millex filter and the DNA recovered by two cycles of ethanoi 
precipitation and resuspended In 20 (il of 0.01 M tromethamlne-HCI buffer pH 7J5. 

Analysis by agarose gel electrophoresis and comparison to a Hindl ll plus Xbal digest of pRlT10164 DNA and 40 
to the fragments from a Hindl ll plus EcoR l digest of phage X DNA showed that two distinct populations of 
Hind Ill-Bam HI fragments were obtained, one with an estimated size of about 1070 bp and one with an 
estimated size of about 1030 bp compared to the 1120 bp parental Hindl ll -Xbai fragment from pRrt10164. 
About 100 ng of the 1030 bp Hlndlll -Bam HI fragment population was llgated with 200 ng of plasmid pUC9 which 
had been digested with Hind lll and BamH I endonucleases and treated with alkaline phosphatase (See, Shine, 45 
U.S. Patent No. 4,264,731) and the ligation mixture was used to transform competent cells of EL coll strain 
JM103 with selection being made for amplcillin resistance. Transformation of E. poll strain JM103 was effected 
according to the method of Cohen et ai., cited above. 

Both the pUC9 vector plasmid and the recipient E. coil strain JM103 have been described by Vlelra and 
Messing in Qene, 19, 259, 1982 and were obtained from J. Messing (Univ. of Minnesota). The pUC9 vector Is 50 
commercially available from Amersham (Buckinghamshire, United Kingdom) and Pharmacia (Uppsala, 
Sweden). E. coll strain JM103 Is available from J. Messing (Univ. of Minnesota). Another EcoQ strain with the 
characteristics of E. coll strain JM103, I.e., JM101 , can be obtained from the American Type Culture Collection, 
Rockville, Maryland, under accession number ATCC 33876, and can also be employed ae host About 400 
ampiclllln resistant colonies per ml were obtained and of 98 colonies tested on medium containing X-gal. 95 56 
were white indicating the successful insertion of a foreign DNA fragment between the Hindl ll and BamHI sites 
on the vector. 

Piasmids from Individual transformant colonies were prepared by a small scale procedure (Blrnbolm and 
Doty, Nucl. Acid Res., 7, 1513, 1979) after amplification of the plasmids by addition of spectlnomycln (150 
pg/mi) to the growing cultures. 60 

The recombinant piasmids were analyzed on 7.5% acrylamide gels after double digestion with Avail and 
BamH I endonucleases and compared to the 450 bp AvailXbai fragment comprising the promoter-N-termlnus 
coding region of pRiT10164 and to the fragments of a Hpa ll digest of pBR322 DNA. An Avall -BamHl fragment 
corresponding to deletions of from 20 to 90 base pairs was present In 35 of 36 plasmids examined when 
compared to pRIT10164. Three plasmids representing deletions of about 80 base pairs (pRIT10168) 50 base 65 
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oairs (pRIT10167-Flgure 8) and 45 base pairs (pRIT10165) were chosen for further study. HasmldDNA was 
piaSbyCsC^^dlum bromide density centrifugation and 25 ug of each digested EcoRLThe BcoRI 
Extremities were labelled with y^P-AW by exchange Wnation end the nucleotide sequence of the 
Hindlll-EcoRI fragments from each plasmld determined by the chemical moditotion metoodsof Maxatnand 
GHtertlMethodsIn Enzymology, 65. 499. 1980). Labelling of and sequencing from the EcoRI site in the pOC9 
vector Dortionof eachmcombinaTrt plasmld is a convenient means of determining the sequence around the 
^ert^HVsntmaZgthe end* the deletion into the TpHSWAfhjgr ^£^SSZSlSS!i 
showed tbitFplasmld pRrT10166. the BamHI site is recreated at a G residue located In the 5- non translated 
SgTn 25 base Jains upstream of the ATGcodon; .n pRfHOIBS the BamHI site Is >^at toe second base of 
thf third codonTand In pRrT10167 the BamHI site Is located ^ * 8 Residue rftoeATC^orL 

The Hlndlll -BamHI fragment of TDH3 DNA thus constructed in pRIT10167 contains all the sequences 
necessary for TDH3 promoter activity in an unaltered form. 

^^^d^H^ffiSrtfrom P RrT10167. prepared as de^^ In ^P-e J was 
rectoned onto th^YEpI 3shuttle vector described by Broach etal (Gene. 8. 121 1979) bettj^thoHlndlllste 
lnttm2mlcron DNAportion of the vector and the BamHI site to give the recombinant plasmld pRIT12159. DNA 
SpRm^Twa?^ digested wnh Xbal and B^ndonudeases and the ^^I^^^Z 
Hasted In Dtace of a similar Xbal-BamHTfragment containing the arg3 promoter on plasmld pRm0774 Plasmld 
Xfo774 ^e^ted by cibeioT^ aUProc. NatL Acad. Sci.US.. 81, 6594^. 1986) ^d comprises me 
S rep?co7from which the natural vector's BamHI and Xhol sites have been deleted by In ^ 
pulation together with a 1470 bp HlndlH-BamHI Insert comprising the arg* promoter ^™««"*>% 
KSnTgment comprising IhFarl^ranscnption termination region * ^^J* 

SronToteTFsert on pRIT10774 by the TDH3 promoter Insert gives P^*P* n ™J^±*l^™ n ™Z 
a dnde BamHI site located at the ATG codon of the TDH3 promoter insert and preceedjng a functional signal 
^^^^ntanSSSn on the 1 150 bp BamHI-Hindlll argS DNA fragment. Foreign DNA can therefore be 
cfonedlS ^Furthermore the two DNAinierteare bounded by Hindlll sites enabling removal of the 
SSissfor , unH f m^dulefoassette) as a 2200 bp Hind"' fragment for insertion on other vectors. Figure 9 Is a 
restriction endonuclease cleavage map of pRIT10172. 

^^i^SSS^C^o^ pRtTIOier (Figure B^rapared a.d^ ^pto1^ 
contehbifl toeTfindllfSimHI TDH3 promoter fragment together with the BamH I-Smal-EcoRI part of the pUC9 

IroZ BamHI site by filling in with T4~DNA polymerase end reHgatlon. The resulting recombinant plasmld was 

a Ra^d^NA of PRrr'10158 (Rgure 8). prepared as described In Example 4 (b). below, was digested with 
Ec^dZ^ T4 SnSmerase to fiDin the E^m Jngle ^^ons^la 

preparation was further digested with Ctel endonuclease. A further sample of pRTi10158 f»£was darted 
ShS end Haelll endonucleases andi 1150 bp CteJHaelll fragment carrying the trar^pti^te^nation 
regioTTof the in? gene was recovered by preparative agarose gel electrophoresis ^ectoMUjtton. ThB 
Sent waa^ated with the EcoRI. T4 DNA poly merase. Ctal treated P RIT10158 D^and I toe JJgion 
Sure was use' to transform extent ceDs ofE coU strain MM294 P™^™*?^*™™^ 
above, to amptefllln resistance. From the transformed colony, a plasmld L WenWed as pRrTIOI^was 
Isolated In which the Cbl-Haelll erg 8 transcription termination fragment was inserted on PR'Ji^SBbetween 
toe and fflled Z^^s andln which me EcoRI site has been recreated^ Rgure 8 Is a flow which 
Istoites the preparation of pR(T 10162. PlasmldDNA of pRIT10162 was digested wrth EcoRJand Psri 
enSnucleases rn^wlth plasmld DNA of pRIT12176. prepared as Ascribed ab^ (atso dtoestedwtth 
EcoRI and Pstl endonucleases) and the mixture was Bgated and used to toansform iceUs of 
so MM»4 to amblcinin resistance by the methods of Cohen at ai. cited above. A ptasmid. pRrri22DB. was 
reared S a coSfrom tote transformation in which the targe 4630 bp EeoRhPstl fragment from 
Sm2?76 hTb^eSed to the small 1930 bp EcoRIPstl fragment from P RIT10ie2. and in which toe erg* 
CsSp^nTen^inaK 
bTexcXtrU^^ 

other orientation of this fragment with respect to vector sequences «»f»^«^™ 
PRIT12208 was recloned Into the Hindlll site of a pBR322 derivative ptasmid In which J^dJco£ I and 
BamHI sites had separately been deleted by filling In with T4 DNA P^raseand r^aflo^ resJflng 
ptasmid. with toe 2200 bp Hindi" fragment Inserted In toe opposite 

sequences compared to P RITi22D8. is Identified as pRIT12290 and deposited as ATCC67186 °" AugwsU* 
1986 at the ATCCln accordance with Budapest Treaty's regulations. Figure 10 ta a ^Udhnmd«M 
deavSe map of P Rrri2290. In both toe pRrri2208 and PRIT12290 vectors, toe ™H3 prom^d |g» 
transcnptlTtermination fragments are separated by unique BamHI. Smal J" ^ 

for the cloning of DNA fragments carrying coding sequences, and toe promoter ^^SSSSSSi 
regions can be excised together on a 2200 bp HfodJ" fragment as a module or cassette for transfer to other 
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The BamH l site is especially useful as ft Is located at the ATG codon of the TDH3 gene and can be employed 
for fusion of other genes to the TDH3 promoter with a view to expressing them In yeast as exemplified below. 
Such fusions can be recovered from pRIT12208 or pRIT12290 derivatives In the form of cassettes or modules 
for Insertion on yeast shuttle vectors by digestion with Hlndlll endonuclease or by the use of other restriction 
endonudeases cutting at flanking restriction sites. 5 

Example 4. Construction of pfasmlds pft!T10677, pRIT10909 and pRIT10158. 

a) pRtT10677 

A 1372 bp BamH l fragment of cloned HBV DNA was excised from pRn*10616 (Harford et aJ M Dev. Biol. 10 
Standard, 54, 125-130, 1983) and mixed and ligated with BamH l endonuclease digested pBR3Z7 DNA. This 
BamH l fragment contains part of the HBsAg precursor region, the HBsAg coding sequence and 3 / non coding 
sequences. From the ligation mixture a plasmld, pRlT10677, was selected which was the HBV BamH l fragment 
Inserted at the BamH l site of the vector In the orientation such that the Xba1 site In the HBV DNA is proximal to 
the Sail site In the pBR327 vector. Figure 8 is a flow sheet which Illustrates the preparation of pRrT10677. 16 

b) pRIT10909 

The 3300 bp Hlndl ll fragment from plasmld pMC200 (available from the American Type Culture Collection, 
Rockville, Maryland, U.SA, without restriction under accession number ATCC 39131), described by Crabeel 
M. et al (EMBO J., 2, 205-212, 1983), was recloned onto a pBR322 derivative plasmld on which the natural 20 
EcoR 1 site had been deleted by filling In with T4 DNA polymerase and rellgatlon. A resulting recombinant 
plasmld, pRm0158 (Figure 8), was selected In which the Hlndlll arg 3 gene Insert has the 3* region of the arg 3 
gene proximal to the Ck1 site on the modified pBR322 vector. From pR!T10158, a 1 150 bp Cla1 Haelll fragment 
was recovered which carries the transcription termination region of the arg 3 gene. This 1 150bp fragment was 
ligated with Oa1 and Hpa1 endonudeases digested DNA of plasmld pRIT1 0677, (prepared as described In Part 25 
a), above, to fuse the arg 3 termination region at the Hpa 1 site downstream of the HBsAg coding region. The 
resulting plasmld is pR!T10909. Figure 814a flow sheet which illustrates the preparation of pRm 0909. Figure 3 
is a restriction endonuclease cleavage map of pMC200. 

Example 5. Construction of plasmlds pRIT10911 and pRIT10912. 36 

The BamHl site on pRIT10167 (Example 1 ) and the natural BamH l site located In the precursor region of the 
HBsAg gene from a HBV virus of adw serotype cloned on pRfT10616 (Harford et al., Dev. Biol. Standard, 54, 
125-130, 1983) are In the same translations! reading frame enabling the construction of a fusion of the TDH3 
promoter fragment which supplies the ATG codon to 42 codons of the HBsAg precursor sequence followed by 
226 codons of the HBsAg coding sequence proper. The source of the HBV DNA fragment for this fusion was a 35 
plasmld pRJT10909 (see, Example 4, part b, above) containing a 935 bp BamHIHpal Insert encoding part of the 
HBsAg precursor region, the full HBsAg coding sequence and 128 bp of 3 / non translated DNA to which had 
been fused, downstream of the Hpal site, an 1 150 bp Haelll-aalll DNA fragment from pRIT10158 containing 
the arg 3 transcription termination region. pRIT10158 ts described In Example 4. part B, above. The 2085 bp 
EcoRI-BamHl fragment was excised from pRIT10909 and Inserted between the EcoRI and BamHl sites of 40 
PRJT10167, prepared as described In Example A, so as to create plasmld pRITIOSII. The 3136 bp Hlndl ll 
fragment from pRrT10911 containing the yeast DNA transcriptional signals and the HBsAg coding sequences 
was then inserted on the YEp13 shuttle vector to give pias- mid pRiT1091Z Figure 8 Is allow sheet illustrating 
the preparation of pRfT10911. 

46 

Example 6. Construction of plasmlds pRIT10903 and pRIT10914 

The aim of the constructions described below was to remove excess nucleotides from the 5' end of a DNA 
fragment encoding at least the N-terminal portion of the mature HBsAg coding sequence In such a fashion as 
to create a 6 base pair restriction site at the first base of the second codon. The resulting fragment can then be 
fused to promoter fragments with a 6 base pair restriction site at the initiation codon using Mung Bean or S1 50 
nucleases to trim away the single strand extensions and create blunt ends for ligation as shown by Rosenberg 
et al.. Methods In Enzymology, 101C. 123 (1983). 

A 1225 bp FnuDII fragment of HBV DNA from pRIT10616 (See, Harford et ai. (1983) Devei. Biol. Standard, 54, 
125-130) and encoding the HBsAg gene was excised from this plasmld and ligated with synthetic octomer 
EcoR I linkers, and the fragment cloned Into the EcoR I site of vector pACYC184 to give pRIT10679 (Figure 8). 55 
Vector pACYC184 was received from S. Cohen and is described by Chang A.C.Y. and Cohen S. (1978), J. 
Bacteriol., 134, 1141- 1156. pACYC184 Is also available from the American Type Culture Collection, Rockville, 
Maryland, U.SA, under accession number ATCC 37033. From this EcoRI (FnuDI I) fragment, a 125 bp 
EcoRI-Xbal fragment was obtained which contains the Cterminal region of the HBsAg precursor, the HBsAg 
ATG codon and 90 bp of HBsAg coding sequence. This 125 bp EcoRJ-XbaJ fragment was Introduced between 60 
the EcoR I and Xbal sites of pRIT10158 (see, Example 4 (b), above). The resulting plasmld. pRIT10903, 
therefore contains a sin- gle EcoR I site located at the Junction between arg 8 DNA and HBV DNA and has a 
single Xhol site located in the arg 3 promoter region. Figure 8 Is a flow sheet which Illustrates preparation of 
pRrri0903. 

150 ^ig of pRm0903 plasmld DNA, prepared by CsOethldlum bromide density gradient centrffugation, was 65 
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digested with 80 units of EcoRI endonudease. extracted with phenol, ethanol precipitated and resuspended 
at CTlO tnM trolneRamme^CI buffer (pH 75). The DMA was divided Intofrree |B » 
"cubated for 50. 75ind 90 seconds with BaJ31 nuclease at 30°. Each reaction mixture contained the same 
eaSnbuffercfted in Example 1. 50 ug of EcoRI digested pRrT109O3 DNA, and 2.5unlts of BaBI nuclease In 
a fS voS o?5S pi. The rLrtons wtreitop^d by addition of to 20 r^M flr^nc^^^h^g 
on ice extraction with an equal volume of phenol and precipitation with etna nol. The BaBltreat^ D 
s^es\STeach taken up In 50 ul 10 mM tromethamine-HCI buffer and Z5 ^ digested with f^EH 
SSS end compared on a 7.5% acTyiamide gel to an EcoRI plus BstBI endonudease ,* gest at 
pSflOOOawhlch reteasesaas bp fragmerrt. Treatment wtth^l nuclease for 75 i ^ 
to reduce the size of this EcoRIBstEII fragment by 10 to 60 base pairs gMng a series of discrete bands 
IstKSS 2 re^esenl !5SrTO«M8 «d 60 base pairs of DNA. Resection of 29 base pairs from the 
original FnuDII site would remove all the HBsAg precursor DNA and the HBsAg ATQ^f"- 

5 ug dfpRiT10903 DNA which had been treated with BaJ31 for 75 seconds was ^edwWi 8unte of Xhol 
nuclease, extracted wtth phenol and ethanol precipitated. This DNA was then incubated wttt 5 ^rteofW DNA 
poZerese in the presence of deoxynucteotJde triphosphates to fill In and render <h^^ ^ 
bST extremities, treated with phenol and ethanol precipitated. The DNA was then ^^J^n mmpS 
BRA lioase for 16 hours at 16° and the mixture used to transform competent cells of ELcoH K12 strain MM294 
toam D » to the method of Cohen et al.. cited above. About 2000 amddOln resistant 

23^V£Z£^** plating. Plasmlds were prepared from 47 ^iyWual c^l^ bye s^ 
sSe procedure (See. Btmbolm et al. cited above), and 23 of 47 were found toretaln a ^^^ica^of 
Successful fllOng In of the original Xhol site and ligation to a 3- end terminating in a G reddue. These 23 
niasSwirefurmer digested wtth^Sl andX^endc^ucteases todeterrnl^ size of the small restnctlon 
ftaoment in comparison with the original EcoR I-Xbal fragment from pRIT10903. 

IZL plSdTweU Identified wtth deliiionsmi«ted to range from 25 to 40 J^^^ecSS 
pRrT10914. which was estimated to contain a XhofXbal fragment of 95 to ,100 base pairs was setected Mbr 
farther study. Figure 8 is a ftow sheet which Illustrates preparation erf pRm M14 JDNA of^ plasjnW was 
purified by Csdethldlum bromide density gradient centrifugatlon and 25 pg of such DNA was digested with 

1 ^^1en^fl e wenTexcn^ge bbeDed with y-^P-ATP. and the labelled DNAwas digested wfftlS units of 
HlndHlllonSS The small 765 bp Hhdlll-Xbal fragment was Isolated by acrylamide ge electrophoresis 

drfeLned by sequencing this fragment from the labelled ^ terminus usmg *e ohen^ mo^catlon 
methods of Maxam and Gilbert, cited above. The sequence data showed that the Xhol site was located at the 
first base of the second codon of the HBsAg coding sequence. 



Xhol 

CTC6AGAAC 
40 HBsAg nucleotides 



This fragment is therefore suitable for fusion to the BamHI extension of the TDH3 promoter fragment on 
pRrri0167 (Example 1) after removal of single stranded ends. 

45 Example 7. Construction of plasmlds pRiT12211. P RiT12209. pRIT12230. andpRIT12265 

Piasmid D Rm0911 prepared as described in Example 5. was used as a scaffold for further manipulations. 
ToSSXon sites, the piasmid was digested with BamHI ^^SSS^^L 
the excised fragment replaced by a 2275 BamHl-Xbal fragment from piasmid PRIT10158 (Example ^4). Thto 

SO m^?Sn rcSufts In piasmid pRIT12211 in which the Hjndlll-BamHI TDH3 *^*g!X?£22l 
B^ES>al fragment containing a Xhol site followed by an J^Wndili n^nt^mprtslng ^termlnd 
region ofthe HBsAg coding sequence and 128 bp of 3- non coding DNA fused to tjie 1150 bp ^ 
Ascription termlnaSon region. Figure 8 Is a flow sheet which '"^"^"P"^^ ESFJSZl 
P Rm2211 was digested wtth Xhol and Xbal endonudeases thereby removl^a1600 bpfragmerrt wt^ich was 

SS replaced by the 94bp XholXbaJfragmehTof modified HBsAg DNA from pRIT10914. prepared asdescribed J 
Sple 5 This 94 bp frag""^ was purified by acrylamide gel electrophoresis from a ™ P^^J^J 
pr#10914 DNA fdlowed by dectreelutJon of the appropriate gel slice and recovery of the DNA by ethanol 

^TheSmld, PRIT12209. resulting from the introduction of the 94 bp fragment (See Figure B^was purified 
60 byCsClSdium bromide density centrifugatlon. and 50 ug of pRm2209DNA v^^^^nteof 
BamHI endonudease and 60 units of Xhol endonudease to remove the 990 bp of ?™ iSTlS 

theTDH3 promoter and the HBsAg coding region, and then was phenol extracted and ^"PJW^™ 
ug of this DMA was Incubated for 30 minutes at 30" with 20 units of Mung Bean nuclease , (P1J Bkjwnteate) to 
a voYume of 125 ul. The buffer used for the incubation was 30 mM Sodium acetate ( P H 4.6), 250 mM Naa 1 mM 
65 Zna" and 5% glycerol. The reaction was stopped by the addition of sodium dodecyl sulphate to give 0.2% 
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final concentration and extraction with an equal volume of phenol followed by ethanol precipitation. An aliquot 
of 0.5 [lq of this Mung Bean nuclease treated sample was Incubated with 2 units of T4 DNA flgase for 16 hours 
at 16° and then digested with 2 units of BamHl endonuclease to destroy any vector molecules which had 
escaped the above treatments. This mixture was used to transform competent cells of E. poll K12 strain 
MM294 with selection for ampicfflin resistance according to the method of Cohen et al M cited above. About 5 
2000 ampiciifln resistant colonies per ml of transformation mixture were recovered and piasmlds were 
prepared by a small scale procedure (according to the method of Blmbotm et at., cited above) from ampBfled 
cultures of 24 of these transformants. 

16 of 24 piasmlds gave two bands of DNA with sizes to 2700 bp (pUC9 vector) and 3000 bp after digestion 
with Hlndlll endonuclease. The 3000 bp band is the size expected for the fusion of the HBsAg coding sequence 10 
and arg 3 termination region to the TDH3 promoter fragment. 

Four candidate piasmlds were taken for further study. DNA of each plasmid was digested with Xbaf 
endonuclease and labelled by exchange Wnatlon with y-^P-ATP followed by digestion with Hlndl ll 
endonuclease and Isolation of the labelled 1190 bp Hlndl ll-Xbal fragment by electrocution and ethanol 
precipitation foUowing acrylamlde gel electrophoresis. The sequence of each fragment was determined by the 15 
chemical modification methods of Maxam and Gilbert, cited above. Two piasmlds were found to have the 
correct sequence ATGGAGAAC for perfect fusion of the TDH3 promoter region to the HBsAg gene. One of 
these piasmlds, pRfT12230, was used In further manipulations. Figure 8 Is a flow sheet which illustrates 
preparation of pRIT12230. DNA of this plasmid (pRn"12230) was digested wRh Hlndlll endonuclease and the 
3000 bp fragment containing the HBsAg coding sequence fused to the TDH3 promoter was llgated onto 2D 
shuttle vector YEp13. The resulting plasmid, PRIT12265, was transformed Into yeast strain DCS (MATa, leu2-3, 
Ieu2-112, hls3, can1-11) (obtained from J. Broach (State University of N.Y. Stony Brook) and available without 
restriction from the American Type Culture Collection under accession number ATCC 20630) using the 
transformation procedure of rto et aL, J. Bact, 153, 163 (1983). 

25 

Example 8. Construction of piasmlds pRm2288, pRFT12322 

The construction on plasmid pRIT12230 (Example 7) contains 128 bp non coding HBV DNA located 3* 
downstream of the stop codon of the HBsAg structural gene. 

To remove the 128 bp of DNA, about 25 pg of pRrT12230, prepared as described In Example 7 above, is 
digested with Accl and EcoRI endonucleases. pRlT12230 contains a single Accl site located in the S-gene 30 
coding sequenced nucleotides before the TAA stop codon. The digested DNA was electrophoresed on a 1<W> 
agarose gel and the larger 4200 bp vector fragment was recovered from the gel by electrocution and ethanol 
precipitation. 

Artificial DNA linker molecules composed of a 12-mer strand and a 14-mer strand with the following 
sequences are synthesized for Insertion between the Accl and EcoR I sites of pRrT12330: 35 

5' ATACATTTAACG 3' 

12 met & 
3 1 TGTAAATTGCTTAA 5 ' 
14 mer 

This will restore the correct Oterminal HBsAg codons, the TAA stop codon and provide an EcoRI site not 45 
present elsewhere In the TDH3 promoter of HBsAg coding sequences for addition of transcription termination 
fragments. 

100 pmoles of the synthetic 12 mer and 100 pmoles of the synthetic 14 mer single strands are mixed 
together In a final volume of 1O-20 fit. heated at 70° for 15 minutes, and allowed to return slowly to room a 
temperature over a period of 3 hr permitting the annealing of the two strands along their complementary 
sequences. To this annealed mixture Is added about 0.15 pmole (400 ng) of the 4200 bp Accl-EcoRl fragment 
of pRIT12230, 10x concentrated ligation buffer and 2 units of T4 DNA llgase. 

This mixture Is incubated for 4 hours at 16° before the addition of a further 2 units of T4 DNA llgase and then 
Incubated overnight on ice. The llgated mixture Is used to transform competent cells of strain MM294 to 
amplciilin resistance according to the method of Cohen et al. (Proc. Natl. Acad. ScL, U.SA, 69, 2110, 1972). 
Piasmlds are prepared from 12 or more individual colonies by the rapid procedure of Blmbotm and Doty (Nud. 
Acids Res., 7, 1513, 1979) and examined by restriction endonuclease analysis. Piasmlds with Insertion of the 
Accl to EcoR I linker will show a single band of DNA of 4200 bp on digestion with either Accl or EcoRI enzyme. 
The correctness of the linker Insertion on such a plasmid can be verified by DNA sequencing from either the 
EcoR I site or Accl site by the chemical modification method of Maxam and Gilbert (Methods In Enzyrnology, 
65, 499, 1980). 

— To provide a transcription termination fragment, a plasmid with the Accl-EcoRl linker Insert, obtained as 
described above, Is digested with EcoRI endonuclease and treated with alkaline phosphatase (Shine J M U.S. 
Patent 4,264.731). The 2650 bp Hlndlll fragment of pRIT1016g, obtained as described In Example 3, was 
recloned In the Hlndlll site of the pBR327 vector plasmid to form plasmid pRIT12288. pBR327 Is described In 
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Soberon et al. (Gene. 9, 287-305, 1980). and can be prepared as described In Example 10. part b. Infra, from 
plasmld pRrT12309 which can be recovered from the American Type Culture Collection under accession 
number ATCC 67187. Figure 8 is a flow sheet which discloses construction of pRIT12288. 

The orientation of the Hindlll fragment on pRIT12288 Is such as to Juxtapose the 2$*™*™^ 
termination region with theEcoRI and CJal restriction sites on the pBR327 moiety. From pRrT12288. a 1 180 bp 
EcoRI fragment is obtained which carries the arg 3 transcription termination region. This fragment Is Bgatea to 
titeEcoRI digested, alkaline phosphatase treated pRIT12230 derivative described above, and plasmlds Issuing 
froinTnTligation mixture are examined for insertion and the orientation of the 1180 bp EcoRI fragment The 
orientation of the insert can be obtained by digestion with Hindlll endonuclease which will Bberate fragments of 
2880 and 2720 bp if the insert orientation is as desired. 

A plasmld pRfTI2322, was constructed In which the Acd-EooRI Dnker replaces the 128 bp of unwanted HBV 
DMA and on which the 1180 bp EcoRI fragment from pRIT12288 was present In thej de ^l^ ta ?? n ' 
downstream of the HBsAg coding sequence. A 2900 bp Hindlll fragment can be excised from PRFT12322 which 
carries the expression module for HBsAg synthesis Including the TDH3 promoter, the HBsAg coding 
sequence and the arg 3 transcription termination region. Figure 8 Is a flow sheet which discloses construction 
of pRIT12322. SeeTalso, Figure 11. which te a restriction endonuclease map of pRIT12232 and shows which 
portions thereof come from pRn*12288 and pRIT12230. 

Example 9. Construction of plasmld pRIT12329 „ ^ 

As descri bed In Example 8 above. pRIT12322 contains a 2900 bp Hindl ll fragment with all the necessary 
signals for expression of HBsAg from the TDH3 promoter. This module was Boated onto a shuttle vector for 

'"o^SutUe vertor YEpi3 was digested with Hind lll endonuclease and alkaline phosphatase and used as 
recipient for the introduction of the Hindlll fragment from P RiT12322. preparedas 'described 
Plasmld pRIT1 2329 was Isolated which consists of the YEp13 replteon together with the 2900 bp Hindlll module 
fragment described In Example 8. 

Example 10. Construction of plasmld vectors for the expression of the hybrid antigen HBsAg-CS in yeast 

30 A. Construction of Plasmlds pRfT12573 and pRiT12574 , „ . 

Plasmld WR201 was obtained from the Walter Reed Army Institute of Research and results from Insertion of 
a 2.3 kOobase (kb) EcoRI fragment from XmPfl (Dame et al.. Science. 225. 593-599 1984) encoding tiie 
complete dicumsporoToite protein gene of Plasmodium falciparum Into vector pUCO. RgureJA shows a 
schematic restriction map and sequencing strategy of clone XmPfl. The positions of restriction enzyme 

SS cleavage sites shown In the figure were determined from the sequence and co^rmed by di QMftn. *. Avan. 
Ac. Accl; B. Bstnl; D. Dral; Dd. Ddel; F. Fold; N. Ndel; R. Rsal; S. Stul; T. Tthlll; Tq. Taql; X. Xholl. Arrows 
Indicate the origin, direction and extent of the sequences determined. The CS protein coding region is shown 
as a heavy line. Figure 3A shows the nucleotide sequence of the CS protein gene from P. falctparurn. The 
nucleotide sequence of the CS protein gene inXmPfl Is shown. Vector pUCBie described ^VietrairtaL(Qene 

40 19 259, 1982). pUC8 is commercially available from Amersham and Pharmacia. A 1215 base pair (bp) Stul-Rsal 
fragment of plasmld WR201 containing the CS protein coding sequence minus the first 52 bp (FflurelZ) was 
purified by electroelution from a 7.5% polyacrylamide electrophoresis gel. This fragment was digested wltn 
Sau3A to generate smaller fragments which included, among others, a 192 bp SauSA fragment coding for 16 
tefrapeptide repeats of the CS protein and three identical 24 bp Sau3A fragments coding for 2 tetrapeptide 

45 repeat of the CS protein. , _ .__ . , „ _.. 

Plasmld PRIT10911 (prepared as described in Example 5) is a pUC9 derivative carrying a3138 bp HJndlll 
cassette containing a 1050 bp HJndJII-BamHI yeast glyceraldeh^P^ervdroger^ |T^H3) P™rnoter 
fragment (which supplies the ATG) fused at the BamHI site to a 935 bp BamHI-Hpal fragment of HBV DNA 
encondlng the C-termlnal 42 amino acids (aa) of the pn>S2 region. 228 aa of the Sgene (sej^ ad^ and 

50 128 bp of 3* non-coding DMA The S gene is flanked by a 1150 bp Y**£^^J**^ "Jftg** 
transcription terminator. pUC9 Is described by Vtelra et ai. (Gene. 19. 259-268 1982). The BamHI start 
PRIT1091 1 . located at the ATG Initiation codon. is in phase with the Sau3A sites of the repetitive epitope of CS 
of Plasmodium falciparum. DMA of plasmld pRIT10911 was digested with BamHI endonudease heated with 
alkaline phosphatase, extracted with phenol and recovered by ethanol precipitation. 03. jig of mis d*™™" 

55 mixed with 02 ug of the Sau3A digest of the Stul-Rsal CS gene fragment prepared as doscnted abowo. treated 
with T4 DNA Hgase. and the ligation mixture was used to transform competent cells of E. con Kiz strain mmbj* 
prepared according to the method of Cohen et al. (Proa Natl. Acad. Sd.. U.S.A.. 69. 2110. 1972) 32 ptosmlds 
from Individual transformant colonies were prepared by a small scale procedure (Bimbclm et al.. Nucleic Ac as 
Research. 7, 1513. 1979) after amplification of the plasmld by addition of spectinomyclne (300 ug per mljtothe 

60 growing cultures. The recombinant plasmlds were analysed on 7m> acrylamlde electrophoresis gel. after 
double digestion with BamHI and Xbal endonucleases, and were compared to the 219 bp Bamnh-Aoai 
fragment of pRrT10911 (comprising the first 42 amino add coding sequence of the Pre S2 region precursor 

and the beginning of HBsAg region) and to the fragments of a Haelll digest of ^x174 DNA. 

Among the 32 plasmlds. one recombinant showed a BamHKba fragment of 411 bp which j oonssponds i to 

65 the insertion of the 192 bp fragment Sau3A In the correct orientation and was designated pRrT12572. Another 
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recombinant showed a BamHi-Xbal fragment of 243 bp corresponding to the Insertion of the 24 bp Sau3A 
fragment In the correct orientation and was designated pRrT12571. Rgure 12 Is a flow sheet illustrating 
preparation of pRrT12571 and pRfT12572. 

The Hjndlll fragments of pRIT12572 and pRIT12571 f containing the TDH3 promoter, the CS-HBsAG hybrid 
coding sequence and the arg3 termination region were recloned onto YEp13, to give, respectively, plasmWs 
pRIT12574 and pRfri2573 (Figure 12). YEp13 Is described by Broach et al. (Gene, 8, 121-133, 1979). Figure 12 
Is a flow sheet illustrating preparation of pRIT12573 and pRIT12574. 

Plasmlds PRIT10912. pRrT12574 and pRIT12573 were Introduced into S. cerevlslae yeast strain DOS (a, 
|eu2-3, Ieu2-1 12, his3, can1-11), and into S. cerevlslae yeast strain 10S44C (pep4-3 Ieu2-3, Ieu2-1 12) obtained 
from M. Crabeel located at the Ceria Institute, Anderlecht, Belgium, (see, also Cabezon, T. et al., Proc. Natl. 
Acad. Sci. U.S.A., 81 , 6594-6598, 1984) by transformation of lithium acetate treated cells (Ito et al. f J. Bacterloi., 
153 , 163-168, 1983) and selection for leucine Independence. Yeast strain DCS was deposited In the American 
Type Culture Collection, Rockvllle, Maryland, U.SA, In accordance with the regulations of the Budapest Treaty 
on June 2, 1982, under accession number ATCC 20630. Yeast strain 10S44C was deposited In the American 
Type Culture Collection, Rockvllle, Maryland, U.SA, En accordance with the regulations of the Budapest Treaty 
on August 18. 1986, under accession number ATCC 20819. The expected hybrid protein for pRIT12573 Is 
predicted to contain 277 amino acids, and for pRIT12574 Is predicted to contain 333 amino acids. 

It Is also possible to add to or enlarge the CS epitope region on the vector of the Invention. For example, on 
plasmlds pRIT12572 and pRiT1257l, additional Sau3A fragments of CS protein DNA, e.g., 192 bp or 24 bp 
Sau3A fragments, can be Dgated in to the BamH I site as follows: 

Plasmld DNA of pRIT12572 or pRIT12571 Is digested with BamH I endonudease and treated with alkaline 
phosphatase. The 1215 bp Stul-Raal fragment of plasmld WR201, obtained as described above, is digested 
with Sau3A endonudease to liberate the 192 bp and 24 bp fragments bearing the tetrapeptide repeats. These 
fragments are mixed with the BamHI and alkaline phosphatase treated pRlt12572 or pRrT12571 vectors, 
treated with T4 DNA ligase and the mixtures transformed Into E. coll K12 cells according to conventional 
procedures with selection for amplcDDn resistance. Plasmlds from transf ormant colonies are screened for the 
size of the BamHI-Xbal fragments by endonudease digestion and compared to similarly digested DNA of 
pRlt12572 67pRfn2571, especially the 411 bp and 243 bp BamH I -Xbal fragments from pRiT12572 and 
pRlt12571 which cany the fusion between the CS repeats and part of the pre S2-S coding sequence. Increase 
in the size of these fragments by 192 bp or 24 bp is indicative of the Insertion of a tetrapeptide repeat fragment 
at the BamHI site of pRIT12572 and pRm2571 . The presence of a BamHI site on such plasmids Is Indicative of 
InsertiorTbfthe 192 bp or 24 bp Sau3A fragments In the correct orientation for fusion at the ATQ codon. This 
process described above may be repeated if desirable to Introduce further 192 bp or 24 bp Sau3A fragments 
encoding the CS tetrapeptide repeats and elongated the CS tetrapeptide region on the hybrid particle. As 
stated before, the Hjndlll fragments from such derivatives of pRrT12572 or pRIT12571 carrying the expression 
module or cassettes may be excised and inserted on YEp13 or other suitable yeast cloning vector for 
introduction Into yeast ceils by conventional techniques. 

(B) Construction of pRIT12309, pRIT12314 and an expression shuttle vector pRIT12377 containing the 
complete 2 micron DNA sequence from yeast 

The plasmld pRIT12309 consists of the complete 6318 bp 2 micron DNA sequence from yeast doned Into 
the EcoR I site of the plasmld vector pBR327. The 6318 bp 2 micron DNA sequence was obtained from plasmld 
pCV19 from J. Broach. Princeton University, NJ. Plasmld pRm2309 was deposited on August 18, 1986 In the 
American Type Culture Collection, Rockvflie, Maryland, U.SA, In E.coli K12 strain MM924 In accordance with 
the regulations of the Budapest Treaty under accession number ATCC 67187. Figure 13 Is a restriction 
endonudease map of pRFT12309. Insertion on pBR327 of the 2 micron DNA sequence through one of the two 
EcoR I sites present on this molecule Interrupts the 2 micron A coding region and renders the resulting hybrid 
molecule pRIT12309 Ineffective for A gene function, also referred to as FLP function. For a description of 2 
micron structure and function see Broach J., The Molecular Biology of the yeast Saccharomyces : Ufa cycle 
and Inheritance", pp. 445470, Strathem J.N., Jones E.W. and Broach J.R. (Eds), Cold Spring Harbor 
Laboratory, New York (1981). 

pBR327 is described by Soberon et al. (Gene, 9, 287-305, 1980) and was obtained from F. Bolivar 
(Department of Molecular Biology, National University of Mexico, Mexlco.20DF). ft will be appreciated by one 
skilled in the art that pBR327 can be recovered from plasmld pRIT12309 above by preparing pRIT12309 
plasmld DNA recovered from the E. coli strain deposited In the American Type Culture Collection under the 
accession ATCC 67187 by any of a number of conventional methods, digesting this DNA with EcoRI 
endonudease, ligating the digest and transforming competent cells of E. coll K12 to amplcfllln or tetracycline 
resistance with the ligation mixture. A substantial number of the tranaformant colonies win contain plasmlds In 
which only the pBR327 moiety of pRIT12309 is present devoid of either of the two 2 micron DNA fragments. ' 

The 2850 bp Ctai-Sall fragment containing the TDH3-arg 8 expression cassette and flanking pBR322 
sequences from pRIT12290 (described In Example 3, above) was recloned onto the pRfT12309 vector between 
the Clal and Sail sites. The resulting plasmld, pRIT12314, was digested with Sail endonudease and Dgated with 
the 2218 bpSail-Xhol fragment from YEp13 containing the yeast LEU2 gene. Rgure 14 Is a flow sheet 
illustrating preparation of pRIT12314. YEp13 Is available from the American "type Culture Collection under 
accession number ATCC 37115. The ligation mixture was used to transform competent ceils of strain JA221 
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prepared according to the method of Cohen et al (above) to both leucine Independence and amplcflfln 
resistance. 

Strain JA221 is publicly available, and was obtained from M. Rosenberg, Smith Wine & French Laboratories. 
Inc., Philadelphia and may be obtained from the American Type Culture Collection under accession number 
ATCC 33875 

A plasmld, pRIT12544, was obtained from a colony from this transformation In which the 2218 bp Sajl-Xhol 
LEU2 gene fragment was inserted at the Sail site of pRIT12314. The orientation of the Insertion Is such so as to 
recreate the Sail site on the side closest to the TDH3-arg* expression cassette. Plasmld pRfT12544 is an E. 
coil - yeast shuttle vector with single BamHI and SmaJ sites located between the TDH3 promoter and arg 3 
inscription termination regions which are suitable for Insertion of DNA coding sequences to effect their 
expression from the TDH3 promoter. Figure 14 is a flow sheet illustrating preparation of pRm2544. 

(C) Construction of Plasmld pRm2658 
The 3328 bp Hindi!) fragment of pRIT12574, a YEp13 derivative prepared as described In Example 10, Part A, 

75 was recloned ontoa pBR322 A BamHI plasmld (where the BamHI site has been destroyed by filling in with T4 
DNA polymerase) to give pRn*12608 where the BamHI site located at the ATG Initiation codon of the 
CS-HBsAg coding sequence is unique (Figure 15). The 2550 bp BamHI-Sall fragment from pRn*12608 
containing the CSHBsAg coding sequence, the arg 8 transcription termination region and flanking pBR322 
sequence was recloned between the BamHI and Sail sites of pRrT12544 (Figure 15), described above, to give 

20 plasmld pR!T12658 (Rgure 15). This exchange placed the CS-HBsAg coding sequences under the control of 
the TDH3 promoter on a complete 2 micron yeast shuttle vector. Rgure 15 is a flow sheet illustrating 
preparation of pRfT12658. 

Example 11. Synthesis of hybrid protein In yeast 
2s Yeast (Saccharomyces cerevjgtae) strains DCS or 10S44C were transformed according to the method of Ito 
et al (J. BacterioL, 153, 163-168, 1983) with each of the plasmids pRIT10912 (Example 5), pRm2573 (Example 
10) ; pRIT12574 (Example 10) or pRIT12B58 (Example 10), or plasmids pRrT12329, (Example 9) and pRIT10172 
(Example 2), and were separately grown in liquid medium lacking leucine (YNB or YNB + 80 yg/ml hlstidine) 
and harvested In mid-log phase. Cells from 1 or 2 ml culture were collected by centrifugation and were 
30 resuspended in 50 (il of 0.125 M trts-HCl (pH6.8) containing 20% glycerol, 4% SDS. 6 M urea and 10% 
Z-mercaptoethanol. After incubation for 5 minutes (min.) at 100°, the sample was divided In halves and both 
were electrophoresed through a 125% separating gel, 5% stacking gel according to the method of LaemmD 
(Nature. 227, 680, 1971). ^ f #o D ^ 

HBsAg and CS protein sequences were Identified by protein ImmunoblottJng technique (See, Bumette, 
& Anal Blochem., 112, 195-203, 1981; Gershoni and Palade. Anal. Biochem.. 131, 1-15, 1983; Towbin and 
Gordon, J. ImmunoT Methods, 72. 313-340. 1984). After electro- blotting of the proteins onto a nitrocellulose 
filter (Schleicher and Sch0.ll, 0.45]i) fTowbln et al, Proc. Natl Acad. Scl, U.S.A.. 76, 4350-4354. 1979) the filter 
was preincubated for one hour at 37°C with 3% gelatin in PBS. Subsequently the filter was washed 5 times 
for 5 min. each time with PBS containing 0.1% Tween 20 (poiyoxyethylene sorbitan monolaurate), and one-half 
of the sheet was treated for one hour at room temperature with a mixture of five monoclonal antibodies against 
CS protein (Dame et al, Science, 225, 593-599, 1984) in PBS, 1% gelatin. The other half of the sheet was 
treated with a monoclonal antibodTagainst HBsAg, HepYTAS12. in PBS, 1% gelatin. HepYTAS12 is an 
antibody raised against purified yeast derived HBsAg which is directed against a reduction and denaturing 
resistent epitope. The filter sheets were washed 5 times for 5 mm. each time with PBS, 0.1% Tween 20, and a 
45 second biotinylated sheep anti-mouse antibody (Amersham) In PBS, 1% gelatin was added for one hour at 
room temperature. The sheets were washed again with PBS, 0.1% Tween 20, 5 times for 5 min. each time, and 
were incubated for 30 min. at room temperature with streptavidin-btotirrylated horseradish peroxidase (HRP) 
complex (Amersham). The sheets were washed again 3 times for 5 min. each time with PBS. 0.1% Tween20, 
and were finally incubated with 30 jil H2O2 and HRP Color Development Reagent containing 4-chtorol-naphtol 
(Bio-Rad), 30 mg In 10 ml methanol. In 50 ml PBS. The molecular weight of the antigens was estimated from 
prestained protein markers, ovalbumin, alphachymotrypslnogen, beta-tectoglobin, lysozyme, cytochrome C 
(BRL). 

Both yeast strains DC5 and 10S44C, transformed with pRIT12573, showed production of an antigen reacting 
with both anti-CS and anti-HBsAg antibodies, and of an estimated molecular weight of 30,000 WlodaJtons (kd). 
Both yeast strains DC5 and 10S44C, transformed with either pRIT12574or pRFT12658 showed production of an 
antigen capable of reacting with both anti-CS and anti-HBsAg antibodies, and of an estimated molecular 
weight of 37,000 kd. Some lower molecular weight antigen was also detected, Indicative of some degradation 
of the hybrid protein. Control samples of cells containing plasmids without any CS or HBsAg sequences 
(pRIT10172) did not show any reaction. Control samples of Cells contalnlong plasmids with only HBsAg 
60 sequences (pRIT12329 or pRrT10912) showed reaction only with the anti-HBsAg antibodies. 

These results show that a hybrid protein of the expected molecular weight containing both CS and HBsAg 
sequences is synthesized In yeast transformed with a plasmld containing the CS coding sequence fused In 
phase to the PreS2-HBsAg coding sequence. m 
Furthermore, while the band intensity in the anti-HBsAg reaction was about the same for pRIT12573 and 
65 PRIT12574, the intensity of the anti-CS reaction was much higher for pRIT12574 than for pRIT12573. This 



40 



SO 



55 



32 



0278 940 



Indicates that for the same quantity of HBsAg sequences there are more sequences of the repetitive epitope 
of the CS protein present In pRIT12o74 than In pRIT12573. 

Example 12. Synthesis of hybrid particles presenting the CSP epitope 

Yeast (Sa ccharomyces cereytefag) strains DCS or 10S44C containing either pRfT10172 (Example 2), 
PRFT12329 (Example 9). pRm2573 (Example 10) or pRfT12574 (Example 10), were grown In selective medium 
(200 ml YNB or YNB + 80 jig/ml hlstldlne) to end of log phase. Cells were collected by centrlfugatlon and 
resuspended In 10 ml of 50 mM sodium phosphate buffer (pH 7.4) containing 05% Tween 20 (polyoxyethylene 
sor brtan monolaurate), 1 mM PMSF (pheriylmethylsulfonyl fluoride) and Z5No Isopropanol. Cells were 
disrupted by passage twice through a French Press at 20,000 psl (1.38 x 10* Pa). The suspension was 
centrifuged for 30 mln. at 30 000 xg. The total protein concentration of the supernatant liquid (crude cell 
extract) was measured by the method of Lowry et al. (J. Biol. Chem., 193, 265-275, 1951) with bovine serum 
albumin as the standard. The presence of HBsAg was assayed using a radlolmmune assay kft (AUSRIA II) 
commercially available from Abbott Laboratories, North Chicago, Illinois or using an EUSA kit (Enzygnost- 
HBsAg micro) commercially available from Behrtngwerke, Marburg, W. Germany. The availabllrty of CS 
sequences In the HBsAg particles to an Immunoreactlon was tested In the above-mentioned EUSA assay. 
Appropriate dilutions of samples to be tested (In PBS containing O^Vb BSA) were allowed to bind (overnight at 
room temperature) to the solid phase antHHBsAg (Enzygnost kft). The bound HBsAg particles were Incubated 
for one hour at 37°C with an 1/10,000 dilution (In PBS containing 0.1% BSA) of a mixture of five monoclonal 
antibodies directed against the CS protein (Dame et al.. Science. ZS, 593-599, 1984) as the first antibody, and 
then Incubated with an 1/500 dilution (In PBS containing 0.1% BSA) of blotinylated sheep anti-mouse Ig 
(Amersham) as the second antibody for one hour at 37° C. Detection was by Incubation with an 1/1000 dilution 
(in PBS containing 0.1<Vb PBA) of streptavteflnblotlnylated horseradish peroxidase complex (Amersham) and 
chromogen from the Enzygnost kft for 30 min. at 37° C. Between Incubations the trays were washed four times 
with washing solution from the Enzygnost kit. Color development was measured at 510 nm in a Tttertek 

M MelS concentration and AUSRIA activity fTable II) In crude cell extracts showed that 

pRIT12573 directed HBsAg expression at the same level as pRIT12329. However, pRiT12574 showed about a 
ten-fold lower AUSRIA activity. Immunobkrt analysis of these extracts showed about an equal amount of 
HBsAg related protein synthesized In those three cases. 

Table II - AUSRIA activity in crude cell extracts 



10 



15 



20 



26 



30 



Plaamid 



Gene 



PRITL0172 
PRIT12329 
PRIT12573 
PRIT12574 



Protein 


HBsAg (RIA) 


HBsAg 


(mg/ml) 


(ng/ml) 


ng/mg 






orotein 


5*6 


0 


0 


3.3 


5700 


1730 


PreS2-S 3.6 


4000 


1110 


PreS2-S 2.8 


450 


160 



36 



40 



60 



The EUSA assay showed that a structure expressing both HBsAg and CS epitopes Is present in pRFT12573 
and pRIT12574 extracts fTable III) and that there are more CS epitopes per HBsAg epitopes that react In 
PRIT12574 extracts than In pRrT12573 extracts. 

To examine the nature of these RIA and EUSA positive structures, one ml of the crude cell extracts was 
mixed with 1.5 M Csd In 50 mM sodium phosphate (pH 7.4) and centrifuged for 40 hours at 40,000ocgfria 
Beckman SW50.1 rotor. Collected fractions were assayed for the presence of HBsAg and CS antibody binding 
activity with the RIA and EUSA methods described above. HBsAg and CS antigenic activity was found to 
coequlll brate in both pRrT12573 and pRIT12574 gradients. Their buoyant density was found to be slightly 
higher (rho » 1.20 g/cm 3 ) than the one for pRfT12329 (rho - 1.18 g/cm 8 ). 

Immunoblot analysis of the gradient fractions confirmed the RIA/EUSA results. HBsAg and/or CS 
sequences were found only In those fractions that did react In the RIA/EUSA assay. 
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Table -H wttsAq an d CS epitopes present in antigen 
K»„nd t-n immobilized ant i-HBsAo antibody 



Crude cell 
extract from: 



Dilution EL ISA (HBsAg) EL ISA (CS) 



OD MB 

"510 



OD 



510 



10S44C (pRIT10172) 



100X 



0.000 



0.065 



10S44C (pRIT12329) 



100X 
200X 
400X 



0.580 
0.341 
0.188 



0.054 
0.041 
0.056 



10S44C (pRIT12573) 



100X 
200X 
400X 
800X 
1600X 
3200X 



1.298 
1.080 
0.702 
0.336 
0.155 
0.075 



1.596 
1.299 
0.898 
0.560 
0.323 
0.189 



10S44C (PRIT12574) 



lOOx 
200X 
400X 
800X 
1600X 
3200X 



0.662 
0.370 
0.183 
0.079 
0.044 
0.061 



1.678 
1.506 
1.130 
0.798 
0.449 
0.205 



These results Indicate that the hybrid protein produced by expression of V^^^S^ISSZ 
asISSo particles Oka the 22 nm HBsAg particles upon syn*-s in yeast J^^nC^S 
expression of pRIT12573 and P RrT12574 dc .expose toO^m S^en^Sa^StSrtthe 
capacity to react with anti-CS antibodies. In the case of pRIT12574. those CS ff^" t ^°" ,' y ™ " |A test 
^SSJdfme a determinant of HBsAg (the one mainly responsible for AUSRIA actvity) In the RIA test 

Example 13. 

^, "1 I. Q r -P5K-3000 column (LKB) which contained both HBsAg and CS antigenictiy. 
al.. Anal Blochem.. 118. 197-203. 1981) showed one major band of estimated molecular weight |MW) ^.uuo 
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kd. Electron microscopy after negative staining with uranyl acetate showed particles of about the same size 
and shape as yeast derived HBsAg. 

B. Purif ication of hybrid partjctes 

Crude yeast extracts are prepared by Dyno Mill disruption of the washed yeast cells In the presence of an 
equal weight of 50 mM sodium phosphate extraction buffer (pH 8.1) supplemented with 4 mM Tltrlplex III, 1 % 
Tween20.4mM PMSF and 10 <Vb Isopropanol. 

Cell debris are removed by centrifugatlon at 11,000xg for 90 mln. Five hundred ml centrtfuged crude extract 
are adjusted to pH 72 with 1N acetic acid and stirred overnight at 4° C In the presence of 10 gr colloidal Silica 
(Aerosll 380, Degussa). Subsequently, the suspension Is centrtfuged at 3,600 g for 1 hour and the pellet Is 
washed three times with 150 ml of 0.15 M Nad supplemented wtth 2 mM PMSF and 5mM EDTA during 15 mln. 
Finally the Aerosll pellet Is eluted with 250 ml 10 mM pyrophosphate buffer (pH 9.5) containing 2 % Tweenj 20 at 
37°C for 3 hours. To the desorbate is added dropwlse a 1 M CaCfe solution to a final concentration of 30 mM 
CaCl2 and the pH Is adjusted to 7. The solution Is left overnight at 4"C and centrtfuged at 6.500g during 15 
mlnutesThe supernatant Is dlalysed against 2 x 51 10 mM Tris/HCI (pH 7.1) at 4»C and subsequently diluted 
four times with dialysis buffer. 

After addition of 30 mM Cads and 1 M NaCI the diluted antigen solution, adjusted to pH 7.1 . Is passed over a 
150 ml column of Phenyl Sephaross FF. equilibrated in the same Cadz/NaCI containing Tris buffer, at a flow 
rate of 30 cm/hr. Subsequently the column Is washed with equilibration buffer until OD 2 eo nmdecreases to 
zero, and eluted at the same flow rate wtth 6 M urea In 10 mM Tris/HCI pH9. 

Alternatively, the dlalysed and four times diluted antigen solution Is supplemented ^J^^^H*)^* (pH 
7) and passed over a 150 ml column of Phenyl Sepharose FF. equilibrated In 10 mM Tris/HCI (pH 7.1) 20 Q* 
(NH4)aS04. at a flow rate of 30 cm/hr. After washing the column with equilibration buffer, the column Is eluted 
at the same flow rate with 10 mM Tris/HCI pH 7.1. 

The antigen positive fractions are pooled and finally centrtfuged In one or two successive CsCI den stty 
gradients at 4° C and 245.000 g for 65 hrs. The purified hybrid Cs-HBsAg particles have a buoyant density of 
1.23 gr/cm a . 

Example 14. Immunogenlctty of hybrid particles 

Hybrid par ticles, purified from S. cerevlslae strain 10S44C containing pniT12574. prepared by the method of 
Example 13, were Inoculated IrrtoTaboratory animals, as such or with aluminum hydroxide as adjuvant (Young 

6t cfeB^rrStee^weeS. guinea pigs, and rabbits were immunized subcutaneous^ end hh^monealty 
on days 0. 21 and 49. Animals were Wed on days 7. 28. 56 and 84. Blood was allowed to clot at 4 C overnight 
and sera were separated and stored at -70°C until used. 

Rabbits. 2 groups of 2 animals each, received 10 ug of the hybrid CS-HBsAg protein In 1 .0 doses, with or 
without aluminum hydroxide, at each Immunization. As a control, 2 rabbits were Immunized with 10 ug 
Heptavax B® (Merck Sharps & Dohme) following the same Inoculation scheme. Mice and guinea pigs. In 
groups of five animals, received 1 ug arid 5 ug hybrid protein respectively. In 05 ml doses at each 
Immunization. Nonlmmunlzed animals served as controls. . „ ._ . 

For determination of antibody responses, sera from mice were pooled for each group while guinea pig and 
rabbit sera were assayed Individually. Anti-CS antibody titers were measured by as^ the 
recombinant CS protein R32tet32 as antigen (See. Young et a).. Science. 228, 958-9(^1985). Anti-HBaAg 
titers were determined by using the AUSAB ktt (Abbott Laboratories) andttte WHO reference. 

Neither guinea pigs nor mice developed appreciable antibody responses to HBaAg despite repeated 
boosting. In constrast. both animal species developed anti-CS antibodies and these i^ponsea were 
augmented by boosting. An absorbance of 1.0 in the EUSA assay was obtained with Serum dilutions of 1600 
fold (mice) and 9000 fold (guinea pigs). Immunogenlctty of the hybrid particles was not enhanced by 
adsorption to aluminum hydroxide. , nUm 

In contrast rabbits developed antibodies to both CS and HBsAg. ReciprocaJffiersatan abaortMrweofl^th 
the anti-CS EUSA assay were between 800 and 3200. Anti-HBaAg titers obtained with the hybrid particles 
adsorbed to aluminum hydroxide were between 4000 and 20.000 mUl/rnl while the rabbits Immunized with 
Heptavax gave titers of 10 8 to 10« mUl/ml. Adsorption of the hybrid particles to aluminum hydroxide enhanced 
the immune response. 

Sera from mice, guinea pigs and rabbits all reacted strongly In the clrcumsporozotte predpltin reaction as 
well as Inhibited sporozotte Invasion of hepatoma cells in vitro (Young et at. Science. 228, 958-962, 1965). (see 
Table IV). Both reactions are Indicative of a protective immune response. 
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Table IV. Circumsporozoite precipitin ( CSP1 reacti- 
vity! and percent inhibition of sporozoite 
invasion (% ISH of HeoG2-A 16 hepatoma cells 

5 



Animal (s) 

1 week Dost 2nd boost 


CSP Reactivity 
(4+ 2+ 0) 


% ISI 


Pooled mouse sera 


14 


9 2 


91% 


Rabbit #36 


22 


3 0 


96% 


Guinea pig #1107 


24 


1 0 


100% 



Degree of CSP reactivity is shown in parentheses: 
0 - No CSP reactivity detectable 
2+ - granular precipitate on surface of sporozoite 
4+ - threadlike filament from end of sporozoite 



Exampl e 15. Vaccine Preparation 

30 An Illustrative vaccine of the subject Invention Is prepared as follows: To a buffered, aqueous solution of 3Rto 
aluminum hydroxide (10 mM sodium phosphate, 150 mM NaCI, pH 6.8; sterilized by filtration), the partdeof 
the invention in similar buffer is added with stirring to a final concentration of 100 jig/ml of polypeptide and 0.5 
mg/ml of aluminum (AP+). The pH is maintained at 6.8. The mixture is left overnight at about 0° C. Thlmersol Is 
added to a final concentration of OSX&Vo. The pH Is checked and adjusted, If necessary, to 6.8. 

35 While the above fully describes the invention and all preferred embodiments thereof, it is to be appreciated 
that the Invention is not limited to the embodiments particularly described but rather Includes all modifications 
thereof coming within the scope of the following claims. 

EXAMPLES OF Pre S2-S PROTEIN AND PreS1-Pre S2-S PROTEIN P RODUCTION BY YEAST 
40 It has been discovered that the Insertion of the Pre S2-S protein coding region of this invention in a yeast 
expression vector into a yeast results In the synthesis of particles resembling the authentic 22 ran HBsAg 
particles exposing glycosylated Pre S2 protein on their surface. It has not been previously correctly reported 
that yeast, transformed with any portion of the HBV genome, express a glycosylated product 

The following examples relate to the expression In yeast of the glycosylated form of a protein containing the 
Pre S2 region In addition to the S protein of Hepatitis B surface antigen. The protein can be extracted and 
purified from yeast cells in the form of particles for use as a human vaccine. The Pre S1-Pre S2-S protein 
expressed in yeast cells Is also glycosylated and may also be extracted and purified therefrom in the form of 
particles for use In a vaccine against Hepatitis B virus. 
With respect to the finding of glycosylate, the following should be noted: 

1. Examples 25, 26 and 27 concern concanavalin A binding, tunicamycln effect and sensitivity to tndo 
H, respectively. Each prove that the pre S2-S protein species of 33Kd Is glycosylated. Thus, glycosylate 
has been proven by three Independent methods. 

2. The specificity of these reagents are, In general terms: 

a) concanavalin A binds preferentially to mannose residues. 

b) tunicamycln inhibits ali types of glycosylate that are N-glycosidically bound to asparagine. 

c) endoglycosldase H cleaves asparagine-flnked high mannose oligosacchardles leaving one 
N-acetylglucosamlne residue attached to the asparagine. 

3. From the combined results of Examples 25, 26, 27 and the above, It Is deduced that there Is a 
N-linked oligosaccharide chain present on the 33Kd form of the pre S2-S protein (tunicamycln inhibition, 
sensitivity to Endo H) which is of the high-martnose type (sensitivity to Endo H). The shift in molecular 
weight observed (about 3000) upon tunicamycln treatment and Endo H digestion, Indicates that the 
oligosaccharide structure is composed of about 10 sugar residues. This is about the size of a core 
glycosylate in yeast Thus, probably only one of the three potential glycosylate sites In the Pre S2-S 
protein Is glycosylated. s 4 . 

65 4. The purification schemes describeD in Examples 28 and 29 both contain an affinity chromatograpny 
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step based on sugar residues of a glycoprotein. 

5. Phenylboranate has a specificity for 1 ,2-cls-dlol groups (2 adjacent OH groups sterically positioned in 
the seme direction). 

6. Lentil Sepharose is glucose/mannose specific. 

5 

Example 16. Construction of pRrT12662 - an E. coll vector containing the PreS2-S expression cassette (Rgure 
16). 

Plasmid pRn*12290, obtained as described in Example 3, above, was modified by the Insertion of a 
BamHI-EcoRI synthetic adaptor fragment encoding the N-termlnal amino acids of the pre S2 region between 
the TDH3 promoter and ARG3 terminator fragments. This was done as follows: w 

Plasmid pRrT12290 was digested with BamHl and EcoRI enzymes and dephosphorylated with alkaline phos 
phatase. 200 pmoles of the phosphorylated synthetic adaptor. 



15 



Gin Trp 

BaraHI 5' GATC CAG TGG 3' EcoR I 

3' GTC ACCTTAA 5' 

were ligated with 0.1 pmoles of the BamHI-EcoRI dephosphorylated plasmid pR(T12290 using 1 unit (U) of T4 20 
Hgase and used to transform the E. coli strain MM294 to ampicillin resistance. One transformant harboring the 
desired plasmid with the synthetic adaptor inserted between the BamHl and EcoRI sites was Identified and 
retained. The plasmid is identified as pRIT12621. Starting with plasmid pRIT12621 f the next steps were the 
elimination of the four extra base pairs (the core of the BamHl site) in order to put the ATG codon in phase with 
the second codon of the Pre S2 region, and the insertion of an EcoR I fragment containing the rest of the pre S2 26 
coding region and the complete S coding region into the EcoRI site of pRfT12621 to complete the entire pre 
S2-S coding region. 

At this stage the DNA manipulations Including subsequent linearization, deletion and reclrcularization of the 
plasmid pRfT12621, were realized without amplification by transformation of Intermediate plasmlds: the 
procedure was as follows: 30 

30 pmoles (120 ug) of pRrT12621 DNA were linearized with 120 U of BamH l. After elimination of the 
restriction enzyme by phenol extraction, the plasmid was ethanol precipitated and resuspended in the 
appropriate buffer to digest the BamH l single strand extensions with 280 units (U) of mung bean nuclease. The 
nuclease was eliminated by phenol extraction followed by ethanol precipitation. The so treated plasmid was 
resuspended in the ligation buffer and reclrculartzed with 10 units of T4 tigase. 36 

The preparation containing the ligated plasmid was phenolized, ethanol precipitated and resuspended Into 
appropriate buffer and digested with EcoR I enzyme. After elimination of EcoR I endonuolease by phenoflzatlon 
and ethanol precipitation, 0.05 pmoles (0.2 ug) of DNA were resuspended In the ligation buffer In order to be 
ligated with 0.4 pmoles (0.2 ug) of a 838 base pair EcoRI fragment Isolated from pRIT12581 that contains the 
entire C-terminai coding region of the pre S2-S protein. A plasmid essentially similar to pRfT12581 can be 40 
constructed as follows. The vector pUC9 (available commercially from Amersham and Pharmacia) is digested 
with EcoR I endonuolease and treated with alkaline phosphatase (Shine, U.S. patent 4,264,731). Plasmid 
pRm0616 (ATCC 39131), is digested with EcoRI and Ape! restriction endonucleases and the 626 EcoRKAccI 
fragment containing part of the pre S2-S coding region is recovered by preparative acrylamlde gel 
electrophoresis and electroeiution. About 0.4 pmoles {02 ug) of this fragment is mixed with about 10 pmoles 45 
of the following 12 bp synthetic adaptor previously annealed as 



so 



lie 
Tyr Stop 
5 ' ATACATTTAACG 3 ' 

AccI EcoR I 

3' TGTAAATTGCTTAA 5' ^ 

described in Example 8, above, and the mixture Is treated with T4 DNA ligase and digested with EcoRI 
restriction endonuolease. To the so treated mixture is added about 02 ug of the EcoRI digested alkaline 
phosphatase treated pUC9 vector, prepared as described above, and the mixture Is treated with T4 DNA ligase 60 
and used to transform competent cells of E. coll K12 to ampicillin resistance by conventional procedures. 
Colonies from the transformant are examined for the presence of a plasmid consisting of the 2650 bp pUC9 
vector together with a 838 bp EcoR I fragment comprised of the 826 bp EcoRJ-AccI pre S2-S fragment together 
with the 12 bp AcchEcoR I synthetic linker. The correctness of the construction so identified can be verified by 
DNA sequencing using the chemical modification method of Maxam and Gilbert (Methods In Enzymology, 65, 65 



37 



0278940 



499, 1980), preferably from a unique flanking restriction site in the pUC9 poiyflnker. 

The ligation mixture containing plasmid DMA of pRIT12621, treated as described above, and the 838 bp 
EcoRI fragment of pRiT12581 was used to transform the E. coll strain MM294 according to the method of 
Cohen et ai., cited above. One transformant having a plasmid with the EcoRI fragment in correct orientation 
5 was retained and the plasmid is identified as pRIT12662 (See, Figure 1A and Figure 16). Figure 1A is a flow 
sheet illustrating the preparation of pRIT12662. Figure 16 is a restriction endonuciease map of pRJT12662. The 
pre S2 region from pRIT12662 was sequenced by the chemical modification procedure of Maxam and Gilbert 
(Methods in Enzymology, 65, 499, 1980) to check that the coding reading frame at the N-terminus was as 
follows: 

10 

Met Gin Trp Asn Ser 
AAAC AAAC AAA ATG CAG TGG AAT TCC 
15 pTDH3 pre S2^S 

coding region 



It will be appreciated by one skilled In the art that the vector pRIT1 2662 may also be used for the introduction 
20 of further functional DNA sequences Into the preS2 region by appropriate in vitro manipulation of vector DNA. 
The preS2 coding sequence contains restriction sites for EcoRI , Psti and BamHI endonucteases. Any of these 
sites may be used for introduction of functional DNA sequences encoding peptides of interest to create In 
phase fusions with the preS2 region. Such fusions will be of the type preS2 - introduced sequence - preS2-S. 
These restriction sites may also be manipulated in vitro to create other vectors and provide other sites or 
25 reading frames for insertion of functional DNA sequences, such as by the method of Botstein et al. (Science, 
229. 1193-1201, 1985). 

Example 17. Construction of pRfT1 2377 and of pRIT1 2660 a yeast plasmid expressing the preS2-S protein 
DNA of plasmid pRTT12309, obtained as described In Example 10(B), was digested with Sail endonuciease 

30 and ligated with a 221 8 bp Xhol-SaJl fragment of DNA containing the yeast UEU2 gene obtained by digestion of 
YEp13 and purification of the fragment by acrylamide gel electrophoresis. The ligated mixture was used to 
transform ceils of E. coll K12 strain JA221, prepared according to the procedure of Cohen et al, cited above, 
with selection being made for both ampicWin resistance and leucine independence. Strain JA221 was obtained 
from M. Rosenberg, Smith Kline and French Laboratories, Philadelphia, Pennsylvania, USA., and is available 

35 from the American Type Culture Collection under accession number 33875. A plasmid, pRm2377, was 
identified from one transformant colony which contains the 2218 bp LEU2 XhoJ-Sall fragment inserted at the 
Sail site of pRIT12309. 

The orientation of the fragment on the vector is such as to recreate the Sail site on the Aval side of the 
pBR327 part of the vector. Plasmid pRIT12377 is an E. coll -yeast shuttle vector having the necessary 
40 functions for selection, replication and maintenance in both organisms. 

The 3050 base pair Hindi II fragment from pRIT12662 (Example 16), containing the pre S2-S expression 
cassette was purified by acrylamide gel electrophoresis, treated with T4 polymerase and ligated to pRrT12377 
previously opened by BamH I enzyme and T4 polymerase repaired. This preparation was used to transform 
strain E. roll MM294 to ampiciDIn resistance according to the method of Cohen et al., cited above. An E. rail 
45 transformant harboring the plasmid pRm2660 was retained. Figure 17 is a flow sheet Illustrating preparation 
of pRrT12660. 

This plasmid, pRlT12660, was used to transform two strains of S. cerevisiae, i.e., strain DCS cir° and strain 
10S44C cir° . according to the method described by Hinnen et al. (Proa Nati. Acad. Sd., USA, 79, 2157-21 61 , 
m 1978). 

50 Both strain DCS clr° and 10S44C cir° were deposited in the American Type Culture Collection. Rockville, 
Maryland, U.SA, in accordance with the Budapest Treaty on August 18, 1986 under accession numbers ATCC 
20820 and ATCC 20818, respectively. 

Example 18 . The preS2 region in plasmid pRIT12662. 

55 The preS2 region in plasmid pRfT12662 was sequenced using the chemical modification method of Maxam 
and Gilbert, Proc. Nati. Acad. Sol., 74, 560-564, 1977) and it was found that there were four base pair changes 
leading to three amino acid changes when compared to another virus of adw2 serotype (Valenzuela et al., 1980 
-Table I page 11). An alanine residue Instead of a threonine residue at position -45, a phenylalanine Instead of a 
leucine at position -34, and a serine residue Instead of an isoieucine residue at position -11. The last change 

60 affects a residue that had been found invariable in all known serotypes until now (Lo et al., Biochem. Blophys. 
Res. Com., 2, 382-388. 1986). 
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Example 19. Synthesis of particles carrying a receptor for polymerized human serum albumin (pHSA) 

Yeast (S. cerevlslae) strains DCS dr° or 10S44C clr° containing either pRFTI2660 or pRm2363 (Example 
16) were grown In selective medium [200 ml YNB -f 80 \xQ/m\ hlstldlne (DCS cir°) or YNB (10S44C cir 6 )] into 
log phase. (pRIT1 2363 is Identical to pRFTI 2660 except for the hepatitis derived gene; pRm2363 contains the 
S gene only). Cells were collected by centrtfugation and resuspended In 50 mM sodium phosphate pH 8.0 
containing 0^<Vb Tween 20 (polyethylene sorbitan monolaurate), 1 mM PMSF (phenyfmethyisulfonyl fluoride) 
and 2.5<Vb isopropanol. Cells were disrupted by passaging twice through a French Press at 20,000 psl (1.38 x 
10 8 Pa). The suspension was centrifuged for 30 mtn. at 30,000 X g. Total protein concentration in the 
supernatant liquid (crude cell extract) was measured by the method of Lowry et aJ., (J. Biol. Chem., 193, 
265-275, 1951), with bovine serum albumin as standard. The presence of HBsAg related material was assayed 
using a radioimmune assay kit (AUSRIA II) commercially available from Abbott Laboratories. The results (Table 
V) indicate that pRfT12660 directs the synthesis of material antigenlcally related to HBsAg particles. The 
expression level, based on AUSRIA activity, is about the same In yeast strains DCS c\f and 10S44C dr°. 

In a radioimmune assay to detect a receptor for polymerized human serum albumin (pHSA) (Pontisso et a!., 
J. Virol Methods , 6, 151-159, 1983), crude cell extracts from pRm2660 containing cells gave a positive 
reaction while crude ceil extracts from pRIT12383 containing cells did not No reaction above background was 
observed with polymerized bovine serum albumin. 

CsCI equilibrium centrifugation of crude cell extracts from pRiT12660 containing cells showed the AUSRIA 
positive material to band around rho — 1.20 g/cm 8 as do serum derived or yeast derived HBsAg particles. 
Material recovered from this portion of the gradient gives a positive result in the pHSA test 

The aforementioned results show that in pRfT12660 containing yeast cells a HBsAg related protein Is 
expressed which is assembled Into a structure like that of 22 nm particles from serum and exposing a receptor 
for pHSA on its surface. 



Table VA. AUSRIA activity in crude cell extracts 



Strain Plasmid Protein HBsAg related HBsAg related 



mg/ml antigenicity antigenicity 
ng/rol ng/mg protein 

DCS cir° pRIT12660 13.4 7 522 

DCS cir° PRIT12660 12-6 7 555 

10S44C Cir° pRIT12660 10.8 6 556 

10S44C cir° pRIT12660 9.5 6 631 



Example 20. Four S-related protein species are found to be expressed in yeast transformed with pFtm 2660. 

To analyze the HBsAg related protein monomers expressed in pRIT12660 containing cells, cells were grown 
In selective medium, either In Erlenmeyer flasks or In fermenters, Into log phase and collected by 
centrtfugation. 

To the packed cells (0.1 gram), first 20 jil 40 mM PMSF in Isopropanol, and then 200 yd sample buffer for 
SDS-polyacrylamide gel electrophoresis (0.1 25 M tris-HCI. pH 6.8 containing 2QVo glycerol, 4*Vb SDS, 6 M urea 
and 10<Vb 2-mercapto-ethanol), were added. Samples were vortexed and allquots of 2 to 20 yd were further 
diluted in sample buffer to a final volume of 40 jil, Incubated for 5 minutes at 1 00° and electrophoresed through 
a 12.5Qfo separating, Gfi/o stacking gel according to the method of Laemmll (Nature, 227. 680, 1071). 

After electroblotting (Towbln et al., Proc. Natl. Acad. Sci., U.SA, 76, 350-4354, 1979) of the proteins onto a 
nitrocellulose filter (0.45 Jim, BioRad), the filter was prelncubated for one hour at 37°Cwtth3<Vb gelatin In PBS. 
The filter was Incubated with a monoclonal antibody directed against a denaturatton and reduction resistant 
epitope of human derived HBsAg (monoclonal 6 obtained from H. Thomas, Royal Free Hospital, London, 
England) and antibody binding was detected by successive Incubation with bkrfnytated sheep anti-mouse Ig 
(RPN 1021, Amersham, dilution 1/250 in PBS containing 1<H> gelatin), steptavldln-biotinyiated horseradish 
peroxidase complex (RPN 1051 , Amersham, dilution 1/400 in PBS containing 1 <Vb gelatin) and finally 30 |il H2O2 
and HRP Color Development reagent containing 4-chtoro-1-naphtol (BioRad), 30 mg In 10 ml methanol, In 50 
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ml PBS. Between incubations the fitter was washed with PBS containing 0.1<Vb Tween 20. 

The immunoblot showed 4 protein bands related to the S protein (p23) of estimated molecular weights of 
about 33Kd, 30Kd, 28Kd and 25Kd which are detected by the monoclonal antibody. The majority of S related * 
material Is found in the 33K protein band and was found to migrate slightly faster than the gp33 protein 
5 described by Stibbe and Qeriich (Virology, 123, 43&442, 1982). The smailest protein band detected migrates 
slower than the S protein (p23) synthesized in yeast 

Example 21. The four S-related protein species (of 33Kd, 30Kd, 28Kd and 25Kd) are assembled in particles. 
Crude cell extracts of pRIT12660 (Example 16) containing yeast ceils were prepared as described in 
10 Exampled 19 or by grinding yeast cells In a Dynomill in the presence of an equal weight of extraction buffer (200 

mM tris, 3 M NaCI, 10 mM EDTA, 10 mM ethylene glycol bis (beta-aminoethyl ether) N.N.N'.NMetraacetlc acid 

(EGTA), 4 mM PMSF, 10% isopropanol) and subsequent centrifugation for 46 minutes at 30,000 X g. 
The pRrT12660 directed particles were partially purified from the crude cell extract by ultrafiltration, CsCI 

equilibrium centrifugation and hydroxyapatite chromatography which are methods known in the art The four 
15 protein bands of S-related material detected by Western blot analysis copurify during these purification steps. 

An immuno-blot analysis (as described in Example 20) of the AUSRIA positive CsCI fractions after equilibrium 

centrifugation showed that the four S-related bands were present in the same relative proportions in each 

fraction. 

The material is thus distributed in homogenous fashion throughout the peak fractions of the gradient 

20 * 
Example 22. The 33Kd and 30Kd proteins cany a receptor for polymerised human serum albumin. 

Human serum albumin (Sigma, A 8763) was polymerised by glutaraldehyde treatment (Merck, A-12179) and 
purified according to Pontisso et al (J. Virol, methods, 6 151-159, 1973). Reactivity of the four S-related % 
proteins towards pHSA was tested In a Western blot procedure. Partially purified preparations of pRIT12660 

25 directed particles (Example 21 ) were separated by SDS polyacrylamlde gel electrophoresis and transferred to 
a nitrocellulose membrane as described In Example 20. The nitrocellulose filter was successively incubated 
with pHSA (15 ug/ml in PBS containing 1<Vb gelatine), rabbit anti-serum to human albumin (Behringwerke. 
ORCB 04/05, 1/125 dilution in PBS containing 1% gelatin), biotinylated donkey anti-rabbit Ig (Amersham RPN 
1004, 1/250 dilution in PBS containing 1% gelatin), streptavkflrhWotinylated horseradish peroxidase 

30 (Amersham RPN 1051 1/400 dilution in PBS containing 1% gelatin) and finally with H2O2 and 
4~chIoro-1-naphtol as described In Example 20. 

The two largest S-related protein bands (the 33Kd and 30Kd) reacted with pHSA, the two smallest ones 
(28Kd and 25Kd> gave no reaction. 

35 Example 23. The 33Kd and 30Kd proteins contain a sequence specific for the preS2 region of the preS2-S 
protein 

A peptide corresponding In 23 of 26 positions to the 28 N-terminaJ amino acids of the preS2-S protein was 
synthesized (NHs-Met-Glu-Tn>-Asn-Ser-^ 

Leu-Pro-AIa-Gly-Cys-COOH) and coupled via the C-termlnal to keyhole Limpet hemocyanin (Peninsula 

40 Laboratories). Rabbits were immunized subcutaneousiy with 100 ug of the conjugate (which corresponds to » 
20 ug peptide) In Freund's complete adjuvant, boosted at days 8 and 15 with another 100 ug conjugate in 
Freund's complete adjuvant and at days 28. 69 and 149 with 1 00 ug conjugate In PBS. Development of antibody 
titer was followed by taking blood samples at regular intervals and testing serum dilutions by Incubation on 
solid-phase fixed synthetic peptide. Bound antibodies were detected by iodinated protein A (New England 

45 Nuclear). For Immunoblot applications, sera with antlpeptide titers greater or equal to 1/512) were selected. 
The serum was used as such after a 1 00 fold dilution or after partial purification of IgG. IgQ was partially purified 
by precipitation with NaaS04 (120 mg per ml of serum), ^suspension In PBS and repreclpitation with 14% 
NaaSO*. The resuspended IgG solution was desalted by passage on a PD10 column (Pharmacia). 
Detection of reactivity of these antibodies towards the S-related proteins in Immunoblots was by 

50 biotinylated donkey anti-rabbit Ig, streptavidin-blotinytaied horseradish peroxidase and H2O2 and 4-chloro- 
1-naphtol as described in Example 22. 

Proteins from partially purified preparations of pRiT12660 directed particles (Example 21) were separated by 
SDS polyacrylamlde gel electrophoresis and transferred to nitrocellulose as described In Example 20. Also run 
on the gel was S protein (p23) from particles purified from yeast harboring pRIT12363. The immunoblot 

55 showed that the two largest S-related protein bands (the 33Kd and 30Kd) react with the anti-peptide 
antibodies while the two smallest ones (28Kd and 25Kd) give no reaction. The S protein (p23) gives no reaction 
with the anti-peptide antibodies. 

Example 24 Human anti-Hepatitis B antibodies from naturally infected persons recognize epltope(s) on the 

60 33Kd and 30Kd preS2-S proteins not present on the S protein. 

Reactivity of pooled gamma giobulin from anti-hepatitis B positive human subjects (Belgian Red Cross, 
100-150 lU/m! Lot 85A08) towards the four S-related proteins was tested by the Immunoblot procedure 
described In Example 20. A partially purified preparation (Example 21) of pRFT12660 directed particles was 
dissociated by SDS and the proteins separated by SDS polyacrylamlde gel electrophoresis. After transfer of » 

65 the proteins to nitrocellulose, the membrane was Incubated with a ten-fold dilution of anti-Hepatitis B gamm 
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globulin in PBS containing 1<Vb gelatine. Antibody binding was detected by successive Incubation wfth 
biotinylated sheep anti-human Ig (Amersham RPN1003, 1/250 dilution in PBS containing 1% gelatin), 
steptavidin-blotinyiated horseradish peroxidase and finally with H2O2 and 4-ch!oro-1-naphtol as described In 
Example 20. 

The two largest S-related protein bands (the 33Kd and 30Kd band) react with the human antibodies, the tyvo 5 
smallest ones (the 28Kd and the 25Kd band) give no reaction. The S protein (p23) derived from particles from 
yeast cells containing pRIT12363 does not react with these human antibodies. These results show that gamma 
globulin obtained from Infected persons specifically recognizes epitopes present on denatured and reduced 
preS2-S proteins which are absent on denatured and reduced S protein. It is known that the epitopes present 
on S protein derived from human serum and yeast are predominantly conformational. 10 

Example 10A. The 33Kd protein species carries a mannosyl containing oligosaccharide structure. 

PreS2-S protein from partially purified preparations of particles from pRIT12660 containing 10S44C clr° cells 
was electrophoreticaJfy separated and transferred to nitrocellulose as described In Example 20. Also run on 
the gel were S protein from particles purified from yeast containing pRrT12363 and HBsAg particles from 15 
Infected human serum obtained from Dr. W.H. Qeriich. Department of Medical Microbiology, University of 
Gdttingen. Federal Republic of Germany. 

One half of the nitrocellulose filter was incubated In 50 ug/ml concanavalin A (Calbiochem A grade) In 10 mM 
tris pH 7,5, 150 mM Nad 1 mM CaCfc 1 mM MnCfe 0.05% Tween 20 and subsequently with 30 pg/ml 
horseradish peroxidase (Sigma type VI P 8375) in 10 mM tils pH 7.5 150 mM NaCI 0.05% Tween 20 and finally 20 
H2O2 and 4-chIoro-1-naphtol as described In Example 20. 

Between incubations the filter was washed with 10 mM tris pH 7.5 150 mM Nad 0.06% Tween 20 (Hawkes, 
Anal Blochem. 123, 143-146, 1982; Ctegg. Anal. Blochem.,127, 389-394, 1982). The other half of the 
nitrocellulose filter was incubated with monoclonal antibody 6 and treated as described In Example 20. 

The 33K protein band reacts with the concanavalin A-peroxJdase reagents while none of the 30Kd, 28Kd, 25 
25Kd protein bands or the S protein derived from pRfT12363 containing ceil reacts. In the presence of 0.1 M 
methyl alpha-D-mannopyranoslde the binding of concanavalin A to the 33K protein Is abolished. 

Example 26. An oligosaccharide structure Is N-glycosidlcally bound to asparaglne 

pRIT12660 and pRIT12377 (Example 17) containing cells were grown in selective medium (YNB + 80 ug/ml 30 
hlstldlne (DC5 clr°) or YNB (10S44C Clr")] to ODs20nm - 0.2. Tunteamycln (Calbiochem) was added to a final 
concentration of 10 ug/ml and the cultures Incubated for 30 mln at 30°C. Then 20 jxCI 86 S-methionine (1000 
Ci/mmoi) (New England Nuclear) per ml was added and the cultures were incubated another 15 minutes. 2 ml 
of labeled cell culture, either treated or untreated with tunicamycin, were centrffuged In a microfuge and the 
packed cells were resuspended in 40 |xl 1 <Vb SDS and disrupted by agitation with a Vortex mixer for 2 mln. In the 35 
presence of 0.3 gram glass beads (diameter -0.45- 0.50 mm). 

Immune precipitation was essentially according to Julius et al.,(Cell, 36, 309-318, 1984). The tysate was 
heated for 3 minutes (mln.) at 100°, diluted wfth 0.5 ml PBS containing m Triton X100, 0J8V6 sodium 
deoxycholate 0.1% SDS and heated again for 1 mln. at 100°. 

25 pi of a 1CWb suspension of prewashed Immunc-precipftfn (Formalin-fixed Staph A celts, Bethesda 40 
Research Laboratories) was added to the boiled extract and Incubated at 0° for 30 mln. The mixture was 
clarified by centrtfugation for 5 mln. in a microfuge. To the supernatant liquid, 2.5 |d monoclonal antibody 6 
specific for S protein (see Example 20) was added, and the mixture was incubated for 1 hour at 0°. Another 
portion (25 pi) of Immuno-preclpltln was then added and Incubated for 30 mln. at 0°. The Immune complexes 
were collected by centrtfugation (5 mln. In a micro fuge) and washed with PBS containing 2M urea, PBS 45 
containing 1% 2-mercaptoethanoI and finally PBS. The final washed pellets were resuspended In sample 
buffer for electrophoresis as in Example 20. 

Samples containing 10 6 cpm were electrophoresed through a 1Z5Wo SDS potyacrytamide gel (see Example 
20). After electrophoresis, gels were fixed In 40% methanol 1(XVb acetic add, treated with Amplify (Amersham) 
for fluorography, dried on filter paper under vacuum and subjected to fluorography using Kodak X-Omat S film 50 
at -70°. 

This showed that pRIT12660 containing cells synthesize a 33Kd protein In the absence of tunicamycin and a 
30Kd protein In the presence of tunicamycin. Both bands are absent In the corresponding samples of 
PRIT12377 containing ceils. 

This results shows that the 33Kd preS2-S protein expressed In yeast strains DC5 clr° carries an 55 
oligosaccharide portion N-glycosydlcaJly bound to asparaglne. There Is most probably only one 
oligosaccharide chain attached to the 33Kd form of the preS2-S protein which is not much larger than a core 
gtycosylation. 

Example 27. The oligosaccharide structure is of the hlgh-mannose type. 60 
Crude cell extracts from pRIT12660 containing cells or partially purified preparations of preS2-S containing 

particles from these extracts (Example 21) were treated with Endo-N-acetytglucosamlnldase (Endo H.; 

EC.3.2.1.96) from Streptomycesplicatus obtained from New England Nuclear. 
Partially purified particles (50 pi, 450 jig/ml protein) were boiled for 3 minutes at 100° In the presence of 

0-1% sodium dodecyf sulphate, diluted ten-fold with 100 mM sodium-citrate pH5 and to 240 pi of this mixture 65 
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2.4 |il Endo H (74 jig/ml) was added. Samples with and without added Endo H were incubated for 4 hrs at 37° . 
Analysis of the Endo H treated and untreated samples by SDS-polyacryiamide gel electrophoresis and 
immunoblotting with detection by the monoclonal antibody 6 directed against the S protein as described in 
Example 20, showed the disappearance of the 33Kd band and the appearance of a 31 Kd band upon Endo H 
5 treatment The 31 Kd band still reacts with the concanavalin A-peroxidase reagents as described above. The 
position of the other bands (30Kd, 28Kd and 25Kd) did not shift upon Endo H treatment Longer incubation 
times or higher Endo H concentrations gave identical results. The result shows that the oligosaccharide 
structure linked to the 33Kd protein Is of the high-mannose type and has about the size of a core glycosytation. 

10 Example. 28 Purification of preS2-S protein containing particles. 

Crude yeast extract containing preS2-S particles is prepared by grinding the washed yeast cells in a 
Dynomiil In the presence of an equal weight of extraction buffer, either 50 mM sodium phosphate pH 8.1, 4 mM 
EDTA 1.rm Tween 20, 4 mM PMSF and 104b isopropanol or 200 mM tris-HCI pH 9.0. 3.0 M Nad, 10 mM EDTA, 
10 mM EQTA (ethylene glycol bis (betajaminoethyl ether N.N.W.N'-tetra acetic acid), 1<Vb TVveen 20, 4 mM 
IS PMSF and 10% Isopropanol. 

The homogenate is adjusted to pH 8.0 or 9.0 respectively and subsequently centrifuged for 45 minutes at 
30,000 X g. 

To 500 ml of the crude yeast extract, adjusted to pH 7.2, Is added 10 gr colloidal Silica (Aerosil 380 Degussa). 
The colloidal suspension Is stirred over night at 4° and subsequently centrifuged at low speed. The Aerosil 

20 pellet is washed three times with 150 ml 0.15 M NaCI for 1/4 hour and then eluted batchwise with 250 ml 10 mM 
pyrophosphate buffer (pH 9.3) containing 2<Vb Tween-20 at 37° for 3 hours. The desorbate, a yellowish, Oghtiy 
opalescent solution, is passed over a 50 ml column of phenyl boronate Agarose (Amicon) equilibrated in 10 mM 
pyrophosphate buffer pH 9.0. The flow rate is 15 ml/hour. 
The column is further washed with 2 column volumes of equilibration buffer, and is then eluted in the 

25 reversed direction with 100 mM sorbitol in 50 mM Trie (pH 72) at a flow rate of 15 mi/hour. Eluted peak 
fractions are pooled and after adjustment to pH 8.1 , passed over a DEAE-Fractogel TSK column equilibrated in 
50 mM Tris pH 8.1. The particle is eluted In the same buffer containing 75 mM Nad. After this step residual 
nucleic acids were removed (<1 ng/20 \ig protein). The balance sheet is shown in Table VI. 
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Tabl * v - Two atep purification of prBS2-S containing 
particlea 



Fraction 



mg S-related 
protein mg 
ml (SIA) protein 



mg 

mg poly- 
liPida saccharide 



10 



Crude 
extract 



500 



11.3 



14,600 



5,286 



7.850 



15 



Desorbate 
aeroail 



270 



5.96 



385 



554 



249 



2D 



FlOW 

through 
PBA- column 



315 



2.83 ' 



365 



360 



267 



25 



Eluate 
PBA- column 



70 



2.8 



2.9 



2.5 



30 



*not sufficiently glycosylated to be retained on the 
column. 



Material purified from extracts of calls containing pRIT12660 as described in this Example 28, followed by a 40 
CsCJ equilibrium centrtfugation, was prepared for electron microscope examination after staining with uranyl 
acetate. Examination showed the presence of discrete particle structures about 19 microns In diameter, and 
that such hybrid particles contained between 5 to 20 ng Tween 20 per 100 pg protein. 

Example 29. Affinity chromatography on lentil sepharose. 45 
In this example, the phenylboronate agarose chromatography step is replaced by affinity chromatography 

on lentil sepharose (Pharmacia). 
50 ml of AeroslWesorbate, obtained as described in Example 28, was passed over a 5 ml column of lentil 

Agarose equilibrated in equilibration buffer such as Trls buffer 10 mM (pH 72), 0.14 M NaCI and OJWo Tween. 

The column was washed with 2 column volumes of equilibration buffer such as Trie Buffer 10 mM (pH 7.2) and 50 

the etuted with sugar additive with specificity for lentil such as 0.5 M aJpha-methylmannopyranoslde in 

equilibration buffer. The balance sheet is shown In Table VII. 
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Table VII. Application on le ntil Seofaarose affinity 
chromatography in the p urification on 
preS2-S containing particles . 



Volume v9 S-related Total 
Fraction (ml) protein (RIA) Protein 



Crude 

Desorbate aerosil 

Flow through 
Lentil-sepharose 

EluaXe 

Lentil-sepharose 



100 
50 
50 



2170 3 gr 

1700 138 rag 

428 

1.26 mg 



12.5 925 



Examination revealed that the hybrid particles purified as described herein contained between 5 to 50 ng 
Tween 20 per 100 ug protein. 

Example 30. Immunogenicity of the preS2-S particle. 

The immunogenicity of particles prepared from extracts of cells containing pRIT12660 (Example 16) was 
studied in 2 strains of mice: Balb/c (H2-d) and NMRI (H2-Q). The particles were absorbed on AI(OH)s before 
injection and the mice (5 mice/group) were Injected intraperitoneally (IP.) with 0.3 ug of RIA reacting material 
One month after the immunization, the mice were bled and the sera from the mice of the same group were 
pooled. Tbe titer of anti-S antibodies was determined In the pools by using the AUSAB kit (Abbott Labs) and 
the WHO reference. The titers were converted in mlU/ml by using the HoIDnger formula (HolIJnger et at, In 
Szmuness et al., eds. Viral Hepatitis, Philadelphia : Franklin Institute Press. 461-466, 1982). The anti-preS2 
antibodies were determined using a solid phase assay (RIA) In which the synthetic preS2 peptide given In 
Example 23, is absorbed on plastic and bound antibodies were detected with a goat anti-mouse IgG labelled 
with 125 l. The results summarized In Table VIII show that the particles from pRtT12660 containing cells induce 
both anti-S and anti-preS2 peptide antibodies In both strains of mice with higher titers of anti-preS2 peptide 
antibodies induced in NMRI than in Balb/c mice. The anti-S titer obtained in Balb/c or NMRI mice Is 2 to 3 times 
higher than the titer obtained with a higher dose of particles from pRIT12363 containing cells which do not 
contain the preS2 sequence. PreS determinants enhance the response to the S determinants. This enhancing 
effect is not shown if the preS2 synthetic peptide is injected together with particles which lack the preS2. 
Moreover, the preS2 synthetic peptide alone is unable to elicit anti-preS2 antibodies. 

This indicates that the association of preS2 determinants and S determinants on the same particle is 
required to obtain an anti-preS2 response. Furthermore this association results In an Improved anti-S 
response. 

The results of the immunogenicity studies referred to herein related to the PreS2-S particle are summarized 
in Table VIII. 
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Table VIII, Immunoaenicltv of the preS2-S particle 



Titre of 

Preparation Dose of S Anti-S antibodies Anti-preS2 
injected related (mIU/ml)(2) antibodies (3) 

antigen 

(pg)(l) Balb/c NMR1 . Balb/c NMR1 



Yeast-derived 1.7 1404 719 0 % 

S particle 
directed by 
PRIT12363 

Yeast-derived 1.55 748 1163 0 4 

S particle 

directed by 

PRIT12363 

+ synthetic 

preS2 peptide: 

0.08 yg 

Synthetic - 2 10 4 

preS2 peptide: 
0.08 pg 

Yeast-derived 0.3 3344 2418 8 120 

(preS2-S) 

particle 

directed by 

PRIT12660 



(1) determined by using the AUSRIA Kit (Abbott Labs) and the Proposed International Reference 
Preparation supplied by Dr. SchikJ, National Institute of Biological Standardization, London, as standard. 
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(2) determined by using the AUSAB kit (Abbott Labs) and the World Health Organization standard. 

(3) determined by using a solid phase radio Immuno assay and synthetic preS2 peptide adsorbed to 
plastic. 

The titer is the dilution which gives a significant binding of the antibody (2.5 times the value obtained with a 
negative control serum). 

EXAMPLES OF THE EXPRESSION OFTHE ENTIRE PreS1-PreS2-S PROTEIN COPING SEQUENCE IN YEAST 

Example 31. Construction of plasmid pRIT12845 that expresses the preS1-preS2-proteln in yeast 

The first step was the fusion of the TDH3 promoter region with the pre S1 N-terminal region from the HBV 
genome. The starting material was plasmid pRJT12792 that harbors a 495 bp Ncol-Xbat fragment containing 
the pre S1-Pre S2 coding sequence except for the last asparaglne codon. The Ncol site overlaps the ATQ 
codon of amino acid residue -163 of the HBV pre S1 sequence. The sequence of the NcohXbaJ fragment is 
shown in Figure 4A. 

An essentially similar plasmid to pRiT12792, Identified as pRIT12793 (5940 bp), was deposited in the 
American Type Culture Collection, Rockvffle, Maryland, U.S.A.. in accordance with the regulations of the 
Budapest Treaty under accession number ATCC 671 93 on September 5, 1986. From plasmid pRfT12793, a 495 
bp Ncol-Xbal fragment can be recovered which contains a complete pre SI -pre S2 coding sequence. This 
fragmentdiffers from that shown In Figure 4A by one nucleotide having the sequence ACGAACTAATAATC- 
TAGA at the 3 r end of the fragment The nucleotide difference is underscored. In the following examples, 
pRIT12792 can be replaced with PRFT12793. Both the Ncol-Xbal fragments contained pRIT12792 and 
pRm2793 were derived by in vitro manipulation of DMA from the PreS1-PreS2 region of the DNA of Hepatitis B 
virus strain (serotype adw)"cloned on plasmid pRIT10616. Plasmid pRrT10616 Is available from the American 
Type Culture Collection under accession number ATCC 39131. 

52 fig of pRIT12792 were digested with the Ncol and Xbal restriction enzymes and treated with Mung Bean 
nuclease in order to eliminate the 5' extensions. 600 ng (2 pmol) of the treated and purified 495 bp NcolXbal 
fragment were Hgated to 170 ng (0.02 pmol) of pRIT12314 vector that had been previously opened by BamH I 
and Smai enzymes and also was treated with mung bean nuclease. pRiT12314 contains the entire 2 micron 
fragment from S. cerevisiae and an expression cassette in which the TDH3 promoter Is separated from the 
ARG3 terminator by a BamHI-Smal-EcoRI linker: 



BamHI EcoR I 



Smai 



ATGGATCCCCGGGAATTC 

A 3 



pTDH tARG 3 



A more detailed description of the construction of pRiT12314 Is found In Example 10(B) above. The ligation 
mix described above was used to transform E-cofl MM294 to ampidllin resistance according to the method of 
Cohen et at. cited above. Six transfbrmants containing a plasmid in which the pre S1-pre S2 fragment was 
inserted in the correct orientation were retained. These were the plasmlds pRiT12843 (a..~f) (Figure 18). 

The second step was the insertion Into each plasmid pRiT12843 (a.....f) described above, of the S coding 
region to reconstitute the entire pre S1-S2-S gene. 

This was done as follows: plasmlds pRiT12843 (a.....f) were each digested with BamH I and Sail enzymes. The 
BamHI site Is placed at the beginning of the pre S2 region, the Sail site Is placed behind the ARG3 terminator in 
pBR 327. 80 ng (0.012 jimol) of the largest BamHI -Sall fragment (10,400 bp) purified from each plasmid 
pRIT12843 (a.....f) were llgated to 300 ng (0.11 pmol) of the small BamHI-Sall fragment from pRIT12660 
(Example 16). The small (4800 bp) BamHI -Sall fragment of pRm2660 contains part of the pre S2-S coding 
sequence followed by the ARQ terminator sequences and the LEU2 yeast sequence. After ligation and 
transformation of E. coli MM294 with each of plasmlds pRIT12843 (a.....f) according to the method of Cohen et 
a!., cited above, a series of ampfcillin resistant clones containing the plasmlds named pRiT12845 (a.....f) was 
obtained. Figure 18 is a flow sheet illustrating preparation of pRIT12845 (a.....f). These plasmlds pRIT12845 
(a....f), were used to individually transform the S. cerevisiae strain 10S44C cir° according to the method of rto 
et a!., cited above. The screening of cultures grown from individual clones from the yeast transformations was 
done by testing crude cells extracts in the commercially available AUSRIA test (Abbott). Crude cell extracts 
were made by disrupting yeast cells resuspended in PBS. containing 0J5M* TWeen 20, 2 mM PMSF and 5% 
isopropanoL One transformant of each of the six separate transformations [with pRrTI2845 (a.....f)] was 
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tested. Five out of six tested transformants gave a positive AUSRIA. 

To test whether a protein of the correct size gave the S-retated antigenicity detected by the AUSRIA, earn 
pies of the crude extracts described above, were subjected to the Immunoblot procedure described in 
example 20. Four of the Ave AUSRIA positive transformants showed a S-related protein of about 39K. One 
transformant (extract a) showed a S related protein of 23K. Retained as pRITI 2845 was the yeast tranformant s 
expressing the 39K protein present in extract (e). See, Figure 18. 

Example 32. The preS1-preS2-S protein Is assembled Into particles carrying a receptor for polymerized human 
serum albumin and a preS1 epitope on their surface. 

Crude ceil extracts from yeast strain 10S44C dr° harboring either pRfT12845, pRIT12660 or pRrT12377 and io 
yeast strain DC5 cir° harboring pRIT12363 were prepared as described In Example 19. Total protein 
concentration, HBsAg related antigenicity and detection of a receptor for polymerized human serum albumin 
were performed as described in Example 19. The result showed a strong binding activity for polymerised 
human serum albumin present in extracts from strain 10S44C cbr* containing pRm2843 and pRIT12660 (see 
also Example 10) which is absent In extracts from strain 10S44C cir° harboring pRIT12377 or strain DCS clr° is 
harboring pRIT12363. 

CsCI equilibrium centrifugation of crude cell extracts from 10S44C dr° cells containing pRn*12845 
(described in Example 12) and measurement of HBsAg related antigenicity by AUSRIA assay in the CsCI 
fractions, showed the antigen to band around rho - 1 2 g/cm 3 . This was confirmed by Immunoblot analysis of 
the CsCI fractions. Pre S1 antibody binding activity In the CsCI fractions was tested by the EUSA assay 20 
described in Example 12. After allowing the antigen to bind to the solid-phase anti-HBaAg, the bound antigen 
was incubated with monoclonal antibody MA18/7 (Example 31) which was then detected by blotinyiated 
anti-mouse Ig. streptavidin-blotlnytated horseradish peroxidase com- plex and chromogen as described In 
Example 12. 

The pre S1 specific antibody binding activity was found to coequiilbrate with the HBsAg related antigenicity 25 
In the CsCI gradient These results show that the pre S1-pre S2-S protein produced in 10S44C cir° cells 
harboring pRIT 12845 Is assembled Into particles like the 22 nm HBsAg particles derived from serum. The 
particles expose pre S1 specific sequences as well as a receptor for polymerised human serum albumin on 
their surface. From the level of activity of those particles In the AUSRIA test and the Intensity of the reaction of 
the pre S1-pre S2-S protein in an immunoblot analysis, It can be concluded that the particle's S determinants so 
are partially obstructed or deformed by the presence of pre S sequences In the particle. 

Example 33. Two proteins of molecular weight of about 38 and 45 Idlodaltons (kD), both carrying preS1 , preS2 
and S epitopes are expressed In yeast transformed with RIT12845 

HBsAg related protein monomers expressed in pRm2845 containing cells were analysed as described In 35 
Example 20. 

Migration and immune reactivity of the cellular proteins were compared to the proteins from HBsAg 
particles (serotype ad) purified from human serum (obtained from Dr. W.H. Geriich, Department of Medical 
Microbiology, University of Gottlngen, Federal Republic of Germany). 

Three Immunoreagents were used to analyse the proteins: 40 

- the monoclonal antibody (obtained from H. Thomas -Example 20) recognizing a S epitope 

- a polyvalent rabbit antiserum raised against the synthetic pre S2 peptide (described in Example 23) 

- monoclonal antibody MA18/7 specific for a pre S1 epitope as described by Heermann et al. (J. Virol., 52, 
396-402, 1984). 

The immunoblot showed, among the cellular proteins derived from pRIT12845 containing 10S44C cells, the 45 
presence of two proteins reacting specifically with all three Immunoreagents described above. The molecular 
weight of these two proteins was estimated to be about 38,000 and 45,000 daltons. The molecular weight 
estimation was based on the migration of the proteins from serum derived HBsAg as defined by Heerman et al. 
(J. Virol., 52, 396-402, 1984). The 38,000 dalton pre S1-pre S2-S protein directed by pRrr 12845 migrates slightly 
faster than the p39 pre S1-pre S2-S protein derived from HBsAg particles. The p39 protein from human serum 50 
particles of serotype ad presumably has a pre S1 region of 119 amino acids (Heermann et al., J. Virol., 52, 
396-402, 1984). pRIT12845 codes for a pre S1 region of 108 amino acids. 

The above results show that yeast cells harboring pRFT12845 express two proteins of about 38 and 45 kD 
carrying pre S1, pre S2 as well as S epitopes. 

55 

Example 34. The 45 Kd preS1-preS2-S protein species carries a N-glycosldically bound hlgh-mannose type 
oligosaccharide chain 

Cells harboring pRrT12845 were disrupted with an equal weight of glass beads (diameter 0.45-0.50 mm) In 
an equal weight of 10 mM sodium phosphate buffer (pH 7.4) containing 20A> SDS and 8 mM EDTA by agitation 
with a Vortex mixer. 60 

After centrifugation, the supernatant was heated for 4 minutes at 100°. 

10 nJ of the heated supernatant fluid was diluted with 40 pj 25 mM sodium citrate buffer (pH 5.0), and 2 )il 
EndoH (74 tig/ml) (see, Example 27) was added. Samples with and without added EndoH were Incu- bated for 
2.5 hrs at 37°. 

The EndoH treated and untreated samples were analysed by SDS-potyacrytamide gel electrophoresis and 65 
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Immunoblotting as described in Example 20. 

Trie blot was analysed with monoclonal antibody 6 (S specific) and MA18/7 (pre S1 specific) (see. Example 
33). 

The immunoblot showed both with monoclonal 6 and monoclonal MA18/7, the disappearance of the 45 kD 
5 band and the appearance of a 41 kD band upon Endo H treatment The migration of the 38 kD band was not 
affected by the Endo H treatment 

These results show that the 45 kD pre S1-pre S2-S protein species carries a N-gfycosidically bound 
high-mannose type oligosaccharide chain. 

10 Example 35. The 38kD preS1-preS2-S protein contains covalently bound myristic acid present in an amide 
linkage 

Cells from yeast strain 10S44C cir° harboring either pRIT12845, pRFT12660 or pRIT1 2377 were grown in YNB 
to an ODe20nm of 0.4. 

Cells (20 ml culture) were labeled for 60 min with 1 ma of 3H-Jabe!ed myristic acid (New England Nuclear) 
15 and then collected by centrifugation. Cells were washed with cold H2O, resuspended In 100 uJ 5 mM tris-HCI 
(pH 7 A) containing 3 mM (DTT, 1<Vb SDS and 1 mM PMSF, and broken with 100 mg glass beeds by six 
30-second spurts of vigorous mixing in a Vortex, cooling on ice between each mixing. Debris was removed by 
centrifugation for 1 minute in a Eppendorf 5414 centrifuge (Towier and Glaser, Proc. Natl. Acad. Scl., 83, 
2812-2816, 1986). 20 uJ of the extract was treated for 3.5 hrs at room temperature with 7 uJ of freshly prepared 4 
20 M hydroxylamine, 20 mM glycine, pH 10. 

Hydroxylamine treated and untreated samples were subjected to SDS poryacrylamide gel electrophoresis 
and fluorography as described in Example 26. The hydroxylamine untreated samples were also subjected to 
immunoblot analysis with the monoclonal antibody recognizing a S epitope, as described in Example 20. 
The immunoblot showed results identical to those described in Examples 20 and 33. 
25 The fluorograph showed a labeled band of 38kD present only in the cellular extract from ceils harboring 
pRIT12845, which was hydroxytamlne-stable. Indicating that the 3H-myrtstate label is present in an amide 
linkage. No labeled bands specific for the extracts from ceils harboring pRIT 12680 were detected. 

These results show that the 38kD pre S1-S2-S protein contains covalently bound myristic acid present in an 
amide linkage, in general, myristate becomes covalently bound to proteins via amide linkage to an 
30 amfno-termlnal glycine. This suggests that the initiator Met has been removed from the pre S1-pre S2-S 
protein and that the Gly residue at position 2 has become available for acylation with myristic acid. 

Example 36. Use of preS1 vectors for insertion of functional DNA coding sequences 
The starting material is DNA of plasmid pRfT 12793 described in Example 31 above. pRn*12793 contains the 

35 pre S1-pre S2 coding sequence on a 495 bp Ncol-Xbal fragment DNA of plasmid pRrT12793 is digested with 
Ncol restriction endonuclease, treated with T4 DNA polymerase to fill in the sticky ends and digested with XbaJ 
endonuclease. The Ncol/T4 DNA poiymerase/Xbal fragment is purified by polyacrylamide ge! electrophoresis 
and electroelution and inserted on vector pUC12 previously digested with Smal and XbaJ endonucleases. The 
cloning vector pUC12 Is commercially available from Amersham (Little Chalfont, Bucks, United Kingdom) and 

40 from Pharmacia (Piscataway, NJ). The plasmid pRfTX is obtained as shown In Figure 5B. in the pre S1-pre S2 
region of plasmid pRITX, a single Ncol site overlaps the ATQ start codon of the pre S1 region, and a XbaJ site Is 
present adjacent to and downstream of the TAA stop codon at the Oterminus of the pre-S2 coding 
sequences. A single BstX restriction site is present on the plasmid close to the N-terminal end of the pre S1 
coding region. 

45 it will be appreciated by one skilled in the art that the vector pRfTX or similar derivative vectors containing 
the pre S1 -S2 sequences may be used for introduction of further functional DNA coding sequences encoding 
peptides of interest to create in phase fusions within the pre S1 region by insertion at the BstX site. 

Furthermore, functional DNA coding sequences may be introduced by digestion of pRFTX and derivatives 
with combinations of BstX and BamHI or EcoRI or Pstl endonucleases and insertion of the functional DNA 

50 coding sequence between these sites, thus removing the greater part of the pre S1 coding region and the 
N-terminal part of the pre S2 sequence. Such fusions will be of the type pre S1-introduced functional DNA 
coding sequence pre S2. Once such fusions are made, they can be combined onto other plasmids containing 
the rest of the pre S1-pre S2 sequences and sequences necessary for maintenance and replication in E. coll 
and S. cerevislae as exemplified in Example 16A. 

55 ~ ~ 

Example 37. Fusion of peptide from proteins of KTLV-III virus to the preS2 region of preS2-*S sequences to 
create novel hybrid particles 

The preS2 region of a HBV preS2-S coding sequence can also be used for the insertion or fusion of peptide 
sequences of other viral antigens Including those of the causative agent of human AIDS syndrome, the 

60 retrovirus known as HIV, HTLV-III or LAV. 

The cloned genome of a HIV virus has been described by Ratner LetaL (Nature, 313, 277-284, 1985) and 
Shaw G. et al. (Science, 226, 1165-1171, 1984). From this genomic clone various subfragments can be 
obtained as functional DNA coding sequences which correspond to peptide regions of interest for fusion to 
the preS2-S sequence to form novel hybrid particles containing epitopes of the HIV virus proteins. Such hybrid 

65 particles can serve as the basis of a human vaccine for protection against the infective agent HIV. 
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Among the peptide regions of Interest are the following: 

a. 07 peptide. 

This region corresponds to a stretch of 45 amino acid residues Immediately adjacent to the processing site 
of the envelope precursor and was defined by Stardch B.R. et al. (Cell, 45, 637-648. 1986). ** s 

b. Peptide 121. 

This region corresponds to a relatively conserved amino add sequence present In several retroviral 
transmembrane envelope proteins (see Ctandolo GJ. et al. (Science, 230, 455M55, 1985). This sequence Is 
thought to participate In the pathogenesis of retrovlraHnduced immunosuppression (Snyder man R. and 10 
Cianciolo GJ., Immunol. Today, 5, 240, 1984). 

c. 'Pressman peptide*. 

A synthetic peptide corresponding to amino acid sequence 735 to 752 of the precursor envelope 
glycoprotein of HIV was used by Kennedy B.C. et al. (Science, 231, 1556-1559, 1986) to immunize rabbits and 15 
rabbit antiserum was used to Investigate the recognition of HIV envelope proteins by Induced antibodies. The 
results suggest that this region could induce an immune response directed against the native glycoprotein. 

A. Fusion of the DNA sequence corresponding to the C7 peptide from a HTLV-ill Isolate to the preS2-S region 

on pRIT10911. 20 

The starting material was plasmid pBH10-R2 containing the cloned genome of HIV isolate BH10. This 
plasmld was obtained from R.C. Qaflo, National Institutes of Health, Bethesda. MD. A 3108 bp SaJl-XhoJ 
fragment of DNA of viral origin which was excised from pBH1 0-R2 and cloned between the Sail and Xhol sites 
of pUC18 to create plasmid pRrT12901. A restriction map of this SaJWChol fragment Is shown In Figure IB, 
Plasmid pUCI8 Is described by Norrander J. et aL (Gene, 26, 101-106, 1983) and Is commercially available from 25 
Pharmacia Inc., Plscataway NJ. 

To create the fusion, DNA of pRIT129Q1 was digested with Bglll and Mboll endonudeases, and the 117 bp 
fragment Isolated by acrylamlde gel electrophoresis and electrocution (see, Figure IB). DNA of this fragment 
was treated with Mung Bean nuclease tp remove single stranded extensions and ligated with DNA of 
PRIT1091 1 (see, Example 5, above) which had been digested with BamHI endonuclease and treated with Mung SO 
Bean nuclease. From this ligation mixture a plasmid, pRlT12893, was isolated In which the 117 bp BgHt-Mboll 
fragment from pRFT12901 has been Inserted on pRIT10911. The nucleotide sequence across the fusion 
Junction between the TDH3 promoter and the 117 bp HTLV-ill Bglll-Mboti, mung bean nudease treated 
fragment Insert on pRIT12893 was determined and ft was found that 10 nudeotides In excess had been 
removed from the Bglll end of the fragment. The sequence found was ss 

5' ATGGAGGAGGAGATATGAGGGA 3' 

where the first ATQ codon underscored is that of the TDH3 promoter region from pRrT1091 1 and the second is 40 
an ATQ codon Internal to the C7 peptide fragment. The second ATQ condon overlaps a TGA termination 
codon. This termination codon is in the same reading frame as the first ATQ codon. DNA sequence 
determination of the 3* end of the fusion region on pRIT1 2893 showed the correct sequence for in phase fusion 
of the Mboll-mung bean nuclease treated end of the HTLV-ill fragment to the preS2 sequence on pRfT10911 as 
shown In Figure 2B. 45 

DNA of pRn~12893 was then digested with Hlndl ll endonuclease and the 3250 bp fragment isolated and 
purified by agarose gel electrophoresis and electroelutbn. The 3250 bp Hlndl ll fragment was then Inserted at 
the Hlndlll site of plasmid YEp1 3 to create pRfT12894. DNA of the plasmid was then used to transform ceRs of 
yeast strains DCS and 10S44C to leucine Independence by the method of Ito et aL as cited In Example 10 
above. 50 

Transcription and translation In yeast cells of the C7-preS2-S fusion DNA on pRm2894 will result In 
synthesis of a small 5 residue peptide commencing at the TDH3 ATQ codon and terminating at the TQA codon 
underscored above and of the C7-preS2-S fusion peptide commencing at the second ATQ codon Internal to 
the C7 sequence. This fusion contains 38 amino add residues from the C7 peptide In place of the 45 residues 
from the intact Bglll -Mbol l-mung bean nuclease treated C7 fragment It will be appreciated that other plasmlds 55 
recovered from the same ligation mixture as pRn*12893 can be examined and sequenced to determine If they 
carry a fusion corresponding to the complete fragment 

B. Fusion of the DNA sequence corresponding to the Peptide 121 to the preS2-S region on pRm0911. 

To create the fusion, DNA of pRIT12901 (described In Part A, above) was digested with HhaJ and Hlndl ll 60 
endonudeases to liberate the 230 bp fragment shown In Figure 1B. This fragment was purified by acrylamlde 
gel electrophoresis and electroelutlon and treated with T4 DNA polymerase to ffil In single strand extensions. 
The so treated fragment was ligated with DNA of pRm091 1 which had been digested with BamHI and Mung 
Bean nuclease. From this ligation mixture, a plasmid was Identified as pRIT12B97 on which the 230 bp fragment 
of HIV DNA has been Inserted In the correct reading frame of fusion to the preS2 sequence of pRI10911 as 65 
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shown in Figure 3B. DNA of plasmld pRFT12897 was then digested with Hlndlll endonuclease, and the 3365 bp 
fragment containing the TDH3 promoter, H JV-preS2-S fusion and the ARG3 terminator was purified by agarose 
gel electrophoresis and electrocution. This 3365 bp fragment was flgated onto YEp13 digested with Hlndl ll 
endonuclease to form plasmid pRIT12898. DNA of this plasmld (pRIT 12898) was then used to transform cells 
5 of yeast strains DCS and 10S44C to leucine Independence by the method of Ito et al. as cited In Example 10, 
above. 

C. Fusion of the DNA sequence corresponding to the •Dreesman'peptide to the preS2 region on pRm091 1 . 
A 60 bp fragment of synthetic DNA with the sequence shown below was synthesized by conventional means 
10 as two single strands which were then annealed together: 

5« GATCTCGACAGGCCCGAAGGAATAGAAGAAGAAGGTGGAGAGAGA 3' 
3 • AGCTGTCCGGGCTTCCTTATCTTCTTCTTCCACCTCTCTCT 5 1 

IS 



5' GACAGAGACAGATCCCCG 3' 
3' CTGTCTCTGTCTAGGGGCCTAC 5' 



This fragment has 5' Bglll and 3* BamHI single strand extensions. About 200 ng of a double stranded 
25 fragment and 300 ng of DNA of plasmid pRFT1Q911 digested with BamHI were mixed and flgated. From this 
ligation mixture, a plasmld, pRIT12899, was Identified and recovered on which the 60 bp fragment has been 
Inserted at the BamHI site of pRIT10911 so as to create an Inframe fusion as shown in Figure 4B. DNA of 
pRIT12899 was digested with Hlnd lll endonuclease, and the 3200 bp Hlndl ll fragment was purified by agarose 
gel electrophoresis and electroelutlon. This Hlndlll fragment was Inserted at the Hlndlll she of YEp13 to create 
30 plasmld pRFTI2900. DNA of plasmld pRFT12900 was then used to transform cells of yeast strains DC5 and 
10S44C to leucine Independence by the method of Ito et al. cited in Example 10, above. 

Example 38. Expression of fusion proteins from pRIR12894 and pRIT12898 
Yeast strains DCS or 10S44C harboring either YEp13, pRFT12363, pRJTl0912, pRfT12894 or pRIT12898 were 

35 separately grown in liquid medium lacking leucine (YNB + 80 pg/ml hlstidlne or YNB), and cellular proteins 
were analyzed by the immunoblot procedure as described In Example 11 using as the first antibody a 
monoclonal antibody directed against a denaturation and reduction resistant epitope of human derived HBsAg 
(monoclonal 6 obtained from H. Thomas. Royal Free Hospital, London, England). pRFT10912 is described In 
Example 5 above. pRlT12894 and pRm2898 are described In Example 37, above. pRFT12363 carries the S # 

40 gene of HBV fused to the TDH3 promoter as the 2900 bp Hlndlll fragment from pRIT1 2322 (Example 8) Inserted 
on a yeast vector plasmid Identical to pRIT12377 (Example 10, part B above) and produces HBsAg under the 
control of the TDH3 promoter. 

The Immunoblot showed a protein band related to S protein of estimated molecular weight of about 32 kD t 
among the total cellular proteins from yeast strains DCS or 10S44C transformed with pRrT12894. A protein 

45 band related to S protein of estimated molecular weight of about 45 kD was present among the total cellular 
proteins from yeast strains DCS or 10S44C harboring pRfT12898, while a protein band related to S protein of 
estimated molecular weight of about 29 kD was present among the total cellular proteins from yeast strain DCS 
harboring pRIT10912. 

Crude cell extracts from yeast strain DCS harboring either pRIT12363, pRm0912, or pRIT12894 were 
50 prepared as described in Example 12. Protein concentration and AUSRIA activity were measured as described 

in Example 12. Immunoblot analysis of these extracts was performed as described above. 
CsCI equilibrium centrifugation of the crude extracts, (described In Example 12) and measurement of 

HBsAg related antigenicity by AUSRIA assay In the CsCI fractions, showed the antigen to band around 

rho = 1.2 g/cm 3 . This was confirmed by Immunoblot analysis of the CsCI fractions. 
55 These results show that the 32 kD fusion protein produced in yeast strain DCS cells carrying pRrT12894 ls 

assembled into particles Dke the 22 nm HBsAg particles derived from human serum. From the level of activity 

of those particles In the AUSRIA test, and the Intensity of the reaction of the 32 kD protein in the Immunoblot 

analysis, It can be concluded that the particles 1 S determinants are partially obstructed or deformed by the 

presence of C7 sequences. 

60 From these results It Is concluded that yeast cells of strains DC5 and 10S44C transformed with either 
pRlT12894 or pRIT12898 specifically synthesize a fusion protein of molecular weight of about 32KD or 45 kD 
respectively, both carrying a S epitope. 

It will be appreciated by one of skill In the art that hybrid particles carrying the HIV virus peptide epitopes can 
be extracted and purified by conventional means from cultures of yeast cells transformed with pRrT12894, 

65 pRIT12898 and pRIT12900. Such purified particles may then be formulated In the presence of adjuvants such 
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as AJ(OH)a or AIPO4 as vaccines for human use. Trie preferred dose will be in the range of 1-1000 \ig protein 
and is determined by conventional trials, it will also be understood that the Invention as exemplified herein 
Includes other peptide regions from HIV viral proteins which can be fused to the preS2-S region of HBV coding 
sequences to form hybrid particles presenting the HIV epitope of Interest. 



Claims 



1. A recombinant DNA molecule comprising a functional DNA coding sequence fused, in phase, to a 10 
portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises the PreS2 coding sequence, PreS1 coding sequence or entire PreS1-PreS2 
coding sequence, the circumsporozoite protein coding sequence of Plasmodium , or a HIV coding 
sequence such as a HIV envelope gene sequence such as the HIV C7 protein coding region, the HIV 
Peptide 121 coding region or the HIV Dreesman peptide coding region. 15 

2. A recombinant vector which comprises a recombinant DNA molecule operatively linked to a 
regulatory region, wherein the DNA molecule contains a functional DNA coding sequence fused, in phase, 
to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the regulatory 
region comprises the 7DH3 promoter. 

3. A recombinant vector which comprises a recom binant DNA molecule operatively linked to a 20 
regulatory region, wherein the DNA molecule contains afunctional DNA coding sequence fused, In phase, 

to a portion of the pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises the PreS2 coding sequence, PreS1 coding sequence or entire PreS1-PreS2 
coding sequence, circumsporozoite (CS) protein coding sequence of Plasmodium , or a HIV coding 
sequence such as a HIV envelope gene sequence such as the HIV C7 protein, the HIV Peptide 121 protein 26 
or the HIV Dreesman peptide. 

4. The vector of Claim 3 which comprises a 192 bp Sau3A fragment coding for 16 tetrapeptide repeats 
of the CS protein of P.falclparum wherein such fragment is one which could be derived from Sau3A 
digestion of a 1215 bp Stul - Rsal fragment of piasmid WR201 containing the P. falciparum CS protein 
coding sequence minus the first 52 bp of such sequence. _ ^ 

5. The vector of Claim 4 which comprises additional Sau3A fragments of CS protein DNA. 

6. A recombinant vector which comprises a recombinant DNA molecule operatively linked to a 
regulatory region, wherein the DNA molecule contains afunctional DNA coding sequence fused. In phase, 
to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises ail or part of the Pre S2 protein coding sequence. 36 

7. The vector of Claim 6 wherein the functional DNA coding sequence comprises the following amino 
acid coding sequence: 



PRE-S2 REGION 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 



40 



46 



60 



66 



60 



66 



51 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



0278940 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn lie Ala Ser 

CAC ATA TCG TCA AGO TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn. 

8. The vector of Claim 6 wherein the functional DNA coding sequence comprises all or part of the Pre 
S1-PreS2 protein coding sequence. 

9. Tne vector of Claim 8 wherein the functional DNA coding sequence comprises the following amino 
acid coding sequence: 

PRE-S1 REGION 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG . 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Tcp Asp Phe Asn Pro He Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 



60 



65 



52 



0278940 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr lie Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asa Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

PRE-S2 REGION 

-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn He Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 



0278 940 

OAC CCP GTG ACG AAC 
Asp Pro Val Thr Asn. 



10. A eukaryotlc host cell transformed with a recombinant vector, wherein said vector Is according to 



Claim 5 



11 The host of Claim 10 which is ayeast cell. _ 
12! A method of preparing a hybrid particle containing HBsAg protein which contains and/or presents a 
peptide encoded by afunctkmai DNA coding sequence which comprises cumiring a eukaiyotc host cell 
transformed with a vector in appropriate culture media and Isolating the particle from a cell lysate or 
extract of such hosfs culture, wherein said vector is according to Claim 5. 

presents a peptide encoded by afunctional DNA coding sequence. 
15 A vaccine comprising an Immunoprotectfve amount of a particle according to Claim 14. 
16. A micelle comprising the particle according to Claim 14 and polysorbate. 

17 A vaccine comprising an Immunoprotectlve amount of a micelle according to Claim 16. 

1 8 A HBV Pre S1 protein coding region which codes for the following amino sequence : 
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-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 5 
GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

Gly Phe Phe Pro Asp His Gin Leu Asp Pro 10 
GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 1$ 
TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 

Trp Asp Phe Asn Pro He Lys Asp His Trp 20 
CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 

Pro Ala Ala Asn Gin Val Gly Val Gly Ala & 
TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 

Phe Gly Pro Gly Leu Thr Pro Pro His Gly 30 
GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 

Gly He Leu Gly Trp Ser Pro Gin Ala Gin & 



GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr lie Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Sec Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 
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linked to a regulatory region. 
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21 Protein encoded by the Pre S1 protein coding region of Claim 18. 
^Avacclne^prising an immunoprotective amount of the protein encoded by the Pre S1 protein 

^HBV^proL coding region or a HBV Pre S2^ protein coding region wherein the Pre S2 
portion thereof codes for the following amino acid sequence: 

Thr 

_55 -50 or , 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-Leu-Gla-Asp- 

-40 - 30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 

Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- 
-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn 

24 A recombinant DMA vector comprising the Pre S2 or the Pre S2-S protein coding region of Claim 23 
^r^ 

P ST^r^C-te?n encoded by the Pre ^ P^^ing .gion of ^rn^. 

2& A vaccine comprising an Immunoprotective amount of a particle encoded by the Pre S2-S protein 
coding region of Claim 23. t m 

30. A micelle containing the particle of aalm28 and P0*»*^- 

31 A vaccine comprising an Immunoprotective amount of the micelle of Claim 3U. 

32 A recombinant DNA vector comprising a Pre S1-Pre S2-S protein coding ^ U8 "^X^Si2nt 
toa rogSo -region wherein the Pre S1 - Pre S2 portion of such sequences codes for the following 
amino acid coding sequence: 

PRE -SI REGION 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 
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GGA TTC TTT CCC GAT 
Gly Phe Phe Pro Asp 

GCA TTC GGA GCC AAC 
Ala Phe Gly Ala Asn 

TGG GAC TTC AAC CCC 
Trp Asp Phe Asn Pro 

CCA GCA GCC AAC CAG 
Pro Ala Ala Asn Gin 

TTC GGG CCA GGG CTC 

Phe Gly Pro Gly Leu 

GGT ATT TTG GGG TGG 
Gly He Leu Gly Trp 

GGC ATA TTG ACC ACA 
Gly He Leu Thr Thr 

CCT CCT GCC TCC ACC 
Pro Pro Ala Ser Thr 

AGG CAG CCT ACT CCC 
Arg Gin Pro Thr Pro 
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CAT CAG TTG GAC CCT 
His Gin Leu Asp Pro 

TCA AAC AAT CCA GAT 
Ser Asn Asn Pro Asp 

ATC AAG GAC CAC TGG 
lie Lys Asp His Trp 

QTA GGA GTG GGA GCA 
Val Gly Val Qly Ala 

ACC CCT CCA CAC GGC 

Thr Pro Pro His Gly 

AGC CCT CAG GCT CAG 
Ser Pro Gin Ala Gin 

GTG TCA ACA ATT CCT 
Val Ser Thr He Pro 

AAT CGG CAG TCA GGA 
Asn Arg Gin Ser Gly 

ATC TCT CCA CCT CTA 
He Ser Pro Pro Leu 



AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



PRE-S2 REGION 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 
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GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn lie Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 



33 A yeast host cell transformed with the vector of Claim 22. 

34. Protein prepared by the PreS1-PreS2-S sequence of the vector of Claim 32. 

35. A vaccine comprising an immunoprotectfve amount of the protein of Claim 34. 

_ 38. a particle comprising protein encoded by a PreS1-PreS2-S protein coding region. 

37. A vaccine comprising an immunoprotectlve amount of the particle of Claim 36. 
38! A micelle comprising the particle of Claim 38 and polysorbate. 
39. A vaccine comprising an Immunoprotective amount of the micelle of Claim 38. 
40 A Pre S1-Pre S2 protein coding sequence which comprises the following amino acid sequence: 
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Pre SI Region 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 

Ala Phe Gly Ala Asn Ser Asn Asn Pro A8p 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 

Trp Asp Phe Asn Pro lie Lys Asp His Trp 



AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro lie Ser Pco Pro Leu 



10 



15 



20 



CCA GCA GCC AAC CAG G?A GGA GTG GGA GCA & 

Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC so 

Phe Gly Pro Gly leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 35 

Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 40 

Gly He Leu Thr Thr Val Sec Thr lie Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 45 

Pro Pro Ala Sec Thr Asn Arg Gin Ser Gly 
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AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

Pre S2 Region 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn lie Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 

& GAC CCT GTG ACG AAC 

Asp Pro Val Thr Asn 

40 41. A recombinant DNA vector comprising the Pre S1-Pre S2 protein coding sequence of Claim 40 

operatively linked to a regulatory region. 

42. A yeast host cell transformed with the vector of Claim 41 . 

43. Protein encoded by the Pre S1-Pre S2 protein coding region of Claim 40. «*> 

44. A vaccine comprising an lrnmunoprotectfve amount of protein encoded by the Pre S1-Pre 52 protein 

45 coding region of Claim 40. «_«oo~~u«i« 

45. A recombinant DNA vector comprising a Pre S1 functional DNA coding sequence - Pre S2-S protein 
coding sequence operatively linked to an expression control sequence. 

46. A yeast host cell transformed with the vector of Claim 45. 

47. Protein expressed by the host of Claim 46. ^ ^ 
50 48. A vaccine comprising an immunoprotective amount of the protein of Claim 47. 

Claims for the following contracting states : Es, Gr _ ^.^^.i^™^ 

1 A method of preparing a transformed eukaryotic host cell which comprises transforming a eukaryotic 

host cell with a vector which comprises a recombinant DNA molecule operatively linked toa regulatory 
55 region containing a functional DNA coding sequence fused. In phase* to a portion of the PreS2 region ofa 

HBV PreS2-S protein sequence wherein the functional DNA coding sequence also encode* me 
clrcumsporozorte (CS) protein of Plasmodium , or a HIV envelope gene sequence such as the HIVC7 
protein, the HIV peptide 121 protein or the HIV Dreesman peptide.. 
2. The method of Claim 1 wherein the host is a yeast cell. 
50 3 Thfr mntfW of ™ aim * whp»*in the host belongs to the genus Saccharomyces. 

4. The method of Claim 3 wherein the host belongs to the species Saccharomyces cerevislae or 

S T^^thtd^ the host Is strain DCS, DCS cir° , 10S44C or 10S44C or 10S44C clr". 

6 A method of preparing a hybrid particle containing HBsAg protein which contains and/or pfBsents a 
65 peptide encoded by afunctional DNA coding sequence which comprises culturing a eukaryotic host cell 
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transformed with a vector in appropriate culture media and Isolating the particle from a cell lysateor 
extract of such host's culture, whertiln said vector Is a vector which comprises a recombinant DNA 
molecule operatlvely linked to a reguWory region containing a functional DNA coding sequence fusedjn 
phase, to a portion of the PreS2 reglori of a HBV Pre S2-S protein sequence wherein the fumrtonalDNA 
coding sequence also encodes the clrcumsporozolte (CS) protein of PlasnKxflum; or a HIV envelope 
aene sequence such as the H1VC7 protein, the HIV peptide 121 protein or the HIV Dreesrnan peptide. 

7 A process of preparing a transformed host cell which comprises transforming a host cell with the 
HBV Pre S1 protein coding region which codes for the following amino sequence : 

-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Prd Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT « 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 
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GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly lie Leu Thr Thr Val Set Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



8. The process of Claim 7 wherein the region is comprised by a recombinant DNA vector. 

9. The process of Claim 7 wherein the host is a yeast cell. 

1 0 The process of Claim 9 wherein the host belongs to the genus Saocharomyces . 

11 A process for preparing the protein encoded by the Pre S1 protein coding region of Claim 7 which 
comprises culturing a host cell transformed with the region In appropriate culture media and Isolating the 
protein from a cell lysate or extract of such host's culture. k .. _ „ um . 

12. A process of preparing a transformed host cell which comprises transforming a host cell witn a mbv 
PreS2 protein coding region or a HBV PreS2-S protein coding region wherein the PreS2 portion thereof 
codes for the following amino acid reference: 

Thr 

_55 -50 or 

Met-Gln-Tcp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-Leu-Gln-Asp- 

-40 - 30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 

Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- 
-10 

Ser-Seu-Sec-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn 



1 3. The process of Claim 1 2 wherein the region is comprised by a recombinant DNA vector. 

14. A process for preparing protein encoded by the Pre S2 or the Pre S2-S protelricoding region of 
Claim 12 which comprises culturing a host cell transformed with the region m appropriate culture media 
and isolating the protein from a cell lysate or extract of such hosts culture. 

15 A process for preparing a particle encoded by the Pre S2-S protein coding region of Claim 12 which 
comprises culturing a host cell transformed with the region In appropriate culture media and Isolating the 
protein from a cell lysate or extract of such host's culture. 

16. A method of preparing a transformed yeast host cell which comprises transforming a yeast host cell 
with a recombinant DNA vector comprising a PreS1-PreS2-S protein coding sequence operatJvely linked 

^r^mSd'of preparing protein encoded by the Pre Si -Pre S2-S protein coding sequence which 
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comprises cutturing a yeast host cell transformed with recombinant DNA vector comprising a 
PreS1-PreS2-S protein coding sequence operatlvety linked to a regulatory region In appropriate culture 
media and Isolating the protein from a cell fysate or extract of such host's culture. 

18. A method of preparing a particle containing polypeptide- encoded by the Pre S1-Pre S2-S protein 
coding sequence which comprises culturlng a yeast host cell transformed with a recombinant DNA vector s 
comprising a PreS1-PreS2-S prbtdth coding sequence operatively linked to a regulatory region In 
appropriate culture media and isolating the particle from a cad fysate or extract of such host's culture. 

19. A method of preparing a transformed yeast host cell which comprises transforming a yeast host cefl 
with a recombinant DNA vector comprising the following PreS1-S2-proteln coding sequence operatively 
linked to a regulatory region. 10 

Pre SI Region 



-163 

ATG GGG 

Met Gly 

GGA TTC 

Gly P&e 

GCA TTC 

Ala Phe 



ACG AAT 

Thr Asn 

TTT CCC 

Phe Pro 

GGA GCC 

Gly Ala 



CTT T?CT 

Leu Ser 

GAT GAT 

Asp His 

AAC TCA 

Asa Ser 



GTT 
Val 

CAG 
Gin 

AAC 
Asn 



CCC AAC 

Pro Asn 

TTG GAC 

Leu Asp 

AAT CCA 

Asn Pro 



CCT CTG 
Pro Leu 

CCT 
Pro 

GAT 
Asp 



20 
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TGG GAC TTC AAC CCC ATC . AAG GAC CAC TGG 

Trp Asp Phe Asn Pro He Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 

Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 

Phe Gly Pro Gly leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 

Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 

Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 

Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 

Arg Gin Pro Thr Pro He Ser Pco Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 

Arg Asp Ser His Pro Gin Ala 

Pre S2 Region 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 

Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 

Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 

Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 
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ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 

Thr Val Asn Pro Ala Pro Asn He Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 

His He Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 

Asp Pro Val Thr Asa 

20. A process for preparing protein encoded by the Pre S1-Pre S2 protein coding region of Claim 19 
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Construction of pRIT1266 
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mPfl mPf15 
AAAAAAGAAAATTATAAATAAATATATATATTCGTGTAAAAAT 

mPf5/8 

50 mPf13 
AAGTAGAAACCACGTATATTATAAATTACAATTC ATG ATG 

Met Met 

100 

tot GTT TCT 
Val Ser 

TTA TTC 
Leu Phe 



AGA 
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TTA 


GCT 
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He 
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r 
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ATG 


AAT 


Tyr 


Asn 
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Glu 
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Gly Glu 

AAT GGA 

Asn Gly 

350 

GAT GAA 

Asp Glu 



Fig. 3A a 
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AAC GAG 

Asn Glu 

AAA AAA 

Lys Lys 

AAT CCT" 



AAA TTA AGG AAA CCA 

Lys Leu Arg Lys Pro 

REGION t • 

TTA AAG CAA CCA GGG 

Leu Lys Gin Pro Gly 

450 

GAT CCA AAT GCA A 

A<sn Prn Asn Ala Asn 



AAA CAT 

Lys His 

GAT GGT 

Asp Gly 

CCA AAT 

Pro Asn 



GTA 
Val 


GAT 
Asd 1 


CCC 
Pro 


AAT 
Asn 


GCC 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GTA 
fVal 


GAT 
Asp| 


CCA 
Pro 


AAT 
Asn 


faOU 
GCA AAC 
Ala Asn 


CCA 
Pro 


AAT 
Asn 


GTA 
P?al 


GAT 
AspJ 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


oou 
AAC 

Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 

Asn, 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


600 

CCA 

Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCC 
Pro 

AAT 
Asn 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


CCT 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


" $5 

CCT 
Pro. 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


CCT 
Pro 


AAT 
Asn. 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro^ 


700 
AAC GTA 
Asn H7aT 


GAT 
Asp! 


CCT 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 



Fig. 3A b 
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750 

CCA AAT GCA AAC CCA AAC GCA AAC CCC 
Pro Asn Ala Asn Pro Asn Ala Asn Pro 



■ 1 

AAT 


GCA 


AAT 


CCT 


AAT 


GCA 


AAC 


CCC 


AAT 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 










srjrj- 










GCA 


AAT 


CCT 


AAT 


GCA 


AAT 


CCT 


AAT 


GCC 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


AAT 


CCA 


AAT 


GCA 


AAT 


CCA 


AAT 


GCA 


AAC 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 






850 












CCA 


AAC 


GCA 


AAC 


CCC 


AAT 


GCA 


AAT 


CCT 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


AAT 


GCC 


AAT 


CCA 


AAT 


GCA 


AAT 


CCA 


AAT 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 




yuu 












GCA 


GCA 


AAC 


CCA 


AAT 


GCA 


AAC 


CCA 


AAT 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


AAC 


CCC 


AAT 


GCA 


AAT 


CCT 


AAT 


AAA 


AAC 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Lys 


Asn 


950 














AAT 


AAT 
AM 1 


CAA 


GGT 


AAT 


V3V3M 


PAA 


GGT 


CAC 


Asn 


Gin 


Gly 


Asn 


Gly 


Gin 


Gly 


His 


Asn 










1000 


ATG 


CCA 


AAT 


GAC 


CCA 


AAC 


CGA 


AAT 


GTA 


Met 


Pro 


Asn 


Asp 


Pro 


Asn 


Arg 


Asn 


Val 


GAT 


GAA 


AAT 


GCT 


AAT 


GCC 


AAC 


AAT 


GCT 


Asp 


Glu 


Asn 


Ala 


Asn 


Ala 


Asn 


Asn 


Ala 












.1050 




GTA 


AAA 


AAT 


AAT 


AAT 


AAC 


GAA 


GAA 


CCA 


Val 


Lys 


Asn 


Asn 


Asn 


Asn 


Glu 


Glu 


Pro 


AGT 


GAT 


AAG 


CAC 


ATA 


GAA 


CAA 


TAT 


TTA 


Ser 


Asp 


Lys 


His 


lie 


Glu 


Gin 


Tyr 


Leu 










Pig. 


3A c 









02/8940 



1100 



AAG 


AAA 


ATA 


AAA 


AAT 


TCT 


ATT 


TCA 


ACT 


1 v/c 


i_ys 


lie 


Lys 


Asn 


Ser 


Mo 

no 


Owl 


Thr 
■ i ii 




REGION I 


I 






GAA 


TGG 


TCC 


CCA 


TOT 


AOT 

Ala I 


GTA 


ACT 


TGT 


Glu 


Trp 


Ser 


Pro 


uys 


ber 


Veil 


Thr 
i ill 




GGA 


AAT 


GGT 


AAT 


CAA 


GTT 


Af5A 


ATA 


AAG 


Gly 


Asn 


Gly 


He 


ol 


I 
I 


Arg 


Ho 


1 V/Q 

Lys 


CCT 


GGC 


TCT 


GCT 


AAT 


AAA 


CCT 


AAA 


GAC 


Pro 


Gly 


Ser 


Ala 


Asn 


Lys 


Pro 


Lys 


Asp 






1200 






GAA 


TTA 


GAT 


TAT 


GAA 


AAT 


GAT 


ATT 


GAA 


Glu 


Leu 


Asp 


Tyr 


Glu 


Asn 


Asp 


lie 


Glu 


AAA 


AAA 


ATT 


TGT 


AAA 


ATG 


GAA 


AAA 


TGT 


Lys Lys 
1250 


lie 


Cys 


Lys 


Met 


Glu 


Lys 


Cys 


TCC AGT 


GTG 


TTT 


AAT 


GTC 


GTA 


AAT 


AGT 


Ser 


Ser 


Vai 


Phe 


Asn 


Val 


Val 


Asn 


Ser 


















1300 


TCA 


ATA 


GGA 


TTA 


ATA 


ATG 


GTA 


TTA 


TCC 


Ser 


lie 


Gly 


Leu 


lie 


Met 


Val 


Leu 


Ser 


TTC 


TTG 


TTC 


CTT 


AAT 


TAG 


ATAAAGAACACAT 


Phe 


Leu 


Phe 


Leu 


Asn 











1350 

CTTAGTTTGAGTTGTACAATATTTATAAAAATATATACTACTTT 



1400 

TTTTCTTA ATTTTC A I I I I I CTTTATATTTTCCGAGGGAATTTAT 

1450 

I I I I I I GTG AATATTTAATT ACGTTTGCG ATTAATTGTAGAAAT 

1500 

ATATATGTATATACTATATTTATAGAATGTGTTATTCTCAAAAA 

Fig. 3A d 
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1550 

CAACAACAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGG 

ATT AA MGTAAAAT AGTTAT AAAT ATTTTC AAAAAT ATTTATAA 

1600 

CACAAAAAATACTTCGAAGTTCATTTAACA I I I I IGTTTATTT 

1650 

ATTTATTTATATATTTCATTTTTACGTATTTATATTATAAAATG 

1700 

GTGTATCTTAAAAATAGTG AACTATATAT ATAAAATATTAATT 

1 750 

T A AAAAAATTAT AACTTTCTTTTTA TTTTCTAAAAT AACTT AAA 

1800 

AATTATATGTTTAAG AA AGGGGTAAATTATAATATTTGT ATAA 

1850 

ATATATAAACATAGATATATTAAATAAAATAACAAATGTACTA 

TATTTGTGCATAAGACGTATACGCTTTATATAATACAACAATA 

TTAATTGTAATAATATTTGTGGTAGTGTGAACACTAAAATTGA 
1950 

TAATAATGATTATAATACAGAAGAAATAAAAAATGAATCCAAT 

2000 

ATAGGATTTACAACAAATATTCATGAAGCAAAAATAATTCAAG 

2050 

AAAAGACATATGGATTAATAATAAACGATAAAATAAAGAAAG 

Pig. 3A e 
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2100 

AAGAATATGATGATTGTAATAATAATAATAATAATAATATTAT 

2150 

AATACAGATAAGAGAAGTTGGACTTAATTATTTTGGAGATAC 

TCTCGATGAATCGAATCCATGTAATGATCTTACAGGTATTAA 
2200 

TATATGGGAAAGTTGTCTTGTGGCTAGTCGATGGTTTAGCG 

2250 

ATTTATCTTTACAG AA I I I I 1 1 I I CGAATAAAMTATTTTAGA 

2300 

AATTGGTGCTGGCAGTGGTTTGGCTAGTATAATAATATTTAT 



ATATTCTAATATTTACAA 



Fig. 3A f 
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Sequence of the Pre si- Pre S2 region of pRIT 12792 



PRE-S1 REGION 

Ncol 
-163 



COATG 


GGG 


ACG 


Mpt 


Glv 


Thr 


CCT 


CTG 


GGA 




1 PLi 


Glv 


TTG 


GAC 


CCT 


I PI 1 


nop 


Pro 


AAC 


AAT 


CCA 


Asn 


Asn 


Pro 


ATC 


AAG 


GAC 


ne 


LyS 


A f\ »r\ 

ASp 


CAG 


GTA 


GGA 


Lain 


vai 


oiy 


GGG 


CTC 


ACC 


Gly 


Leu 


Thr 


TTG 


GGG 


TGG 


Leu 


Gly 


Trp 


ATA 


TTG 


ACC 


He 


Leu 


Thr 


CCT 


CCT 


GCC 


Pro 


Pro 


Ala 


GGA 


AGG 


CAG 


Gly 


Arg 


Gin 


CCT 


CTA 


AGA 


Pro 


Leu 


Arg 



AAT 
Asn 


CTT 
Leu 


TCT 
Ser 


TTC 
Phe 


TIT 
Phe 


CCC 
Pro 


GCA 

Ala 

Mid 


TTC 
PhP 

ll IKS 


GGA 
Glv 


GAT 

Aon 
MSp 


TGG 

Trn 

i rp 


GAC 

Aon 
Mop 


CAC 
His 


TGG 
Trp 


CCA 

rTO 


GTG 
vai 


GGA 
oiy 


GCA 
Ala 


CCT 
rrO 


CCA 
rTO 


CAC 
nis 


AGC 
Ser 


CCT 
Pro 


CAG 
Gin 


ACA 
Thr 


GTG 
Val 


TCA 
Ser 


TCC 
Ser 


ACC 
Thr 


AAT 
Asn 


CCT 
Pro 


ACT 
Thr 


CCC 
Pro 


GAC 
Asp 


AGT 
Ser 


CAT 
His 


Pig 


. 4A a 





GTT 
Val 


CCC 
Pro 


AAC 
Asn 


GAT 
Asd 


CAT 
His 


CAG 
Gin 


GCC 
Ala 


AAC 
Asn 


TCA 
Ser 


TTC 

□ho 
rll8 


AAC 
Msn 


CCC 
r ro 


GCA 

Alo 

Aia 


GCC 

Alo 

Aia 


AAC 
Asn 


TTC 


GGG 
vjiy 


CCA 

r |Q 


GGC 
Gly 


GGT 
Gly 


ATT 
He 


GCT 
Ala 


CAG 
Gin 


GGC 
Gly 


ACA 
Thr 


ATT 
He 


CCT 
Pro 


CGG 
Arg 


CAG 
Gin 


TCA 
Ser 


ATC 
He 


TCT 
Ser 


CCA 
Pro 


CCT 
Pro 


CAG 
Gin 


GCC 
Ala 



"0*278S^0 



PRE-S2 REGION 



-55 

ATG 

Met 


CAG 
Gin 


TGG 
Trp 


AAT 
Asn 


TCC 
Ser 


ACT 
Thr 


GCC 
Ala 


TTC 
Phe 


CAC 
His 


CAA 
Gin 


GCT 
Ala 


CTG 
Leu 


CAG 
Gin 


GAT 
Asp 


CCC 
Pro 


AGA 
Arg 


GTC 
Val 


AGG 
Arg 


GGT 
Gly 


CTG 
Leu 


TAT 
Tyr 


TTT 
Phe 


CCT 
Pro 


GCT 
Ala 


GGT 
Gly 


GGC 
Gly 


TCC 
Ser 


AGT 
Ser 


TCA 
Ser 


GGA 
Gly 


ACA 
Thr 


GTA 
Val 


AAC 
Asn 


CCT 
Pro 

« 


GCT 
Ala 


CCG 
Pro 


AAT 
Asn 


ATT 
He 


GCC 
Ala 


TCT 
Ser 


CAC 
His 


ATT 
He 


TCG 
Ser 


TCA 
Ser 


AGC 
Ser 


TCC 
Ser 


GCG 
Ala 


AGG 
Arg 

Xbal 


ACT 
Thr 


GGG 
Gly 


GAC 
Asp 


CCT 
Pro 


GTG 
Val 


ACG 
Thr 



AA TAATAATCTAGA 



Fig. 4A b 



J. 




Figure 1B 
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5* 



Bgl II 



.Mboll 3' 



117 bp fragment fror 
pRIT12901 



Mung bean nuclease 



TTC 
AAG 



I 



GTGC 
CACG 



^-*~C7-preS2-S 

(metarg 



.pRIT10911/BamHI/ 
mung bean nuclease 



PRO 



ATGGAGGAGGAGATATGAGG 
TACCTCCTCCTCTATACTCC 



VAL 

GTGC CC 
"CACG GG 




preS2-S 



pTDH3 



Rgure 2B 



* 



0278940 



Hha I - Hind III fragment (230bp) op pRIT12901 



» 




Figure 3B 
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p. Dreesman (Synthetic) 



GAT CTC[GAC AGG 

ag[ctg TCC 

ASP ARG 



AGA TCClcCG 
TCT AGG] GGC CTA G 
ARG SER 



MET ASP LEU ASP 

ATG GAT CTC GAC 
TAC CTA GAG CTG 



\ 



pRIT10911/BamHI 



PRO ASP PRO 

CCG GATCCC 
GGC CTA GGG 



/ 



pTDH3 




pfeS2-S 



Figure 4B 
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pUC12 
Smal, Xbal 




PRIT12793 
Ncol 

T4 polymerase 
Xbai 

purify 502 bp fragment 




Figure 5B 
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